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METHODS FOR ANALYZING ANIMAL PPnnnr TS 

The present invention relates to methods for analysing animals and their products. In 
5 particular, the invention relates to methods for differentiating animal products on the 
basis of breed origin, determining or testing the breed origin of an animal product and 
for validating an animal product, as well as to kits for carrying out such methods. In 
addition, the present invention provides methods for the determination of pig genotype 
with respect to coat colour. 
10 . 

Introduction 

Animal breeds 

For thousands of years, selective pressure has been applied by humans in the course of 
15 animal husbandry to produce livestock exhibiting certain desirable characteristics. 
These characteristics have been selected to meet aesthetic, technical, ritual, social and 
economic needs. The result has been the production of a large number of different 
animal breeds. 

2 0 The term "breed" is a term of art used to define a homogenous, subspecific group of 
domestic livestock with definable and identifiable external characteristics that enable it 
to be separated by visual appraisal from other similarly defined groups within the same 
species. The term therefore defines a group of animals to which selective pressure has 
been applied by humans to give rise to a uniform appearance that is inheritable and 

2 5 distinctive with respect to other members of the species. 

As breeds become established, their integrity is maintained by breed societies, 
herdbooks and pedigree records. 

3 0 Breed selection 

Conventional breed selection methods are based on direct measurement of the 
phenotype of an animal and/or its relatives. Thus, the implementation of breeding 



• 



schemes requires extensive phenotypic record keeping. For example, dairy herd 
improvement programs in the United States and Western Europe relied in part on the 
collection of individual records (milk yield and composition, type traits, health traits, 
etc.) performed on a monthly basis for millions of cows. Likewise, breeding companies 
5 carefully monitor their pig and poultry breeding stock for a whole range of phenotypic 
measurements. 

However, some important characteristics are not immediately apparent at the level of 
the living animal. For example, many parameters of meat quality are determined by 
10 subtle physiological or biochemical characteristics which are not readily apparent and 
so cannot serve as the basis for efficient artificial selection. 

Breeding for qualities of this type has relied in part upon selection for other (more 
readily apparent traits) which are to some extent coinherited (linked or associated) with 
15 the desirable characteristics. For example, in the pig industry lop ears have in the past 
been associated with mothering ability and so have been used as a marker for this trait. 

Conventional breed selection methods are limited by the fact that some phenotypes are 
expressed only in one sex or at a specific developmental stage. Moreover, some 

20 phenotypes are difficult and costly to measure. Indirect detection of such phenotypic 
traits via DNA-based diagnosis (for use in marker-assisted selection or MAS) is 
therefore seen as a desirable^aitemative toUirect measurement of phenotypic parameters 
(see Georges and Andersson (1996), Livestock genomics comes of age, Genome 
Research, Vol. 6: 907-921). However, the gene structure-function relationships 

25 underlying many of the desirable traits are often highly complex and not yet sufficiently 
well-established to make such an approach feasible in practice. 

Breed identification 

The definition of animal breeds is currently at a watershed. Whereas previously they 
3 0 have been defined by overt physical characteristics and pedigree records, in the future 
as new breeds are developed from specific breeding lines they will be defined by sets of 
DNA markers. The work described herein allows not only the most accurate approach 



to breed determination currently possible in a range of products but also allows the 
integration in a common format of breed determinant information obtained through use 
of the present invention with that which will be used in the future. The present 
invention therefore allows not only the determination of source breed in the current 
5 environment but also links this to the development of future breeds and their unique 
identification. 

It is generally recognized that the only definitive way to identify a particular animal as a 
representative of a given breed is through its pedigree. Thus, despite the fundamental 

10 importance of overt phenotypic traits in the breeding process and in the maintenance of 
breed purity, those skilled in the art generally consider that breed identity cannot be 
definitively characterized on the basis of visual inspection of such traits. By way of 
example, the genetic factor causing the belt phenotype in pigs is dominant to the non- 
belted form. Thus, a belted animal may result from an animal of a belted breed such as 

1 5 Hampshire being crossed with a non-belted breed. 

As stated in PIGS A handbook to breeds of the world, V. porter, Helm Inf. Ltd, ISBN 1- 
873403-17-8, 1993, page 16, "What is a Breed?": 

2 0 Appearances can be deceptive: never judge a pig breed by its coat! 

However, in many circumstances breed identification on the basis of direct evidence of 
pedigrpe is difficult or impossible. Thus, in practice, so-called "breed markers" may be 
used to determine breed identity. 

25 

The term "breed marker" is a term of art which defines a measurable characteristic 
which on the basis of empirical data appears to be breed specific. Breed markers include 
genotypic features such as DNA polymorphisms, chemical features such as protein and 
water contents of meats, epigenetic/biochemical features (such as protein 

3 0 polymorphisms), chromosome structure, gene copy number, DNA fingerprinting, 

microsatellite analysis and RAPD DNA markers. 




4 

Other useful markers include breed determinants. The term "breed determinant" is vised 
herein to indicate an overt phenotypic characteristic which is used (at least in part) as 
the basis of artificial selection during breeding programmes. It is used in 
contradistinction to the term "breed marker", which (as explained above) is used herein 
5 to define other characteristics which appear to be breed specific on the basis of 

empirical data. The term "breed determinant gene" is used to indicate a gene which is 

■ • -. 

involved (at least in part) in the expression of the corresponding overt phenotypic 
characteristic. 

1 0 Some breed determinants (e.g. coat colour) have traditionally been used as breed 
"trademarks", and so have long served as an indication of pedigree (and breed identity). 
Other breed determinants that have also been selected for in breed development include 
features such as ear carriage, face shape and general anatomical conformation. The 
advantage of breed determinants relative to simple breed markers is the inseverable link 

1 5 between the characteristics of the breed and the determinant. 

Biochemical and genetic tests for breed identity • 

Many of the breed markers discussed above can be characterized using biochemical or 
genetic tests. Such markers include genotypic features (e.g. DNA polymorphisms), 
2 0 biochemical features (e.g. protein polymorphisms), chromosome structure, gene copy 
number, DNA fingerprints, microsatellite patterns and RAPD DNA markers. 

However, there are significant problems associated with such tests, as discussed below: 

25 Tests based on the chemical composition of animal products (e.g. meat or seminal 
plasma) may be compromised by the fact that the chemical profile varies between sites 
in the animal (ie different muscles)and is affected by diet, age, sex and sample storage 
conditions. Moreover, the results obtained are usually quantitative in nature, leading to 
problems with interpretation and comparison between different test sites. 

30 

Tests based on protein polymorphisms are limited by the fact that the distribution of any 
given protein is unlikely to be uniform, so that the protein of interest is absent in certain 



tissues. Thus, a number of different polymorphism markers may be required to check 
all products of interest for breed provenance. Moreover, such tests are based on 
antibody assays, and a significant investment is also required to develop the reagents for 
a specific antibody test. 

5 

Chromosome structure analyses are compromised by the high level of skill required for 
cytogenetical methodology and interpretation and the elaborate precautions and care 
required for sample preservation. Such markers are poorly applicable on anything but 
materials derived from living or newly deceased animals. 

10 

Classical DNA fingerprinting is based upon regions of repeated DNA sequence that due 
to their structure show a large degree of variation in length within a population. Such 
regions are often present in a number of copies within the DNA of an individual, thus 
increasing the potential for individual variation. By separating fragments of the total 
15 DNA according to size and then defining the position (and so size) of the hypervariable 
region using a specific probe, a fingerprint of a series of bands for a particular 
individual can be obtained. A number of probes for hypervariable regions of DNA have 
been examined in pigs (including Ml 3 viral sequences and human minisatellite probes) 
and it is claimed that specific bands were found in each breed. 

20 

Random Amplified Polymorphic DNA (RAPD) markers are based upon PCR 
amplification of DNA fragments using primers of random sequence. Such reactions 
generally give rise to a number of DNA fragments which can be characterised 
according to size by gel electrophoresis. If the products of reactions based upon DNA 

25 from different breeds are examined there is the possibility of finding certain DNA bands 
which are breed specific. However, there is in most cases no direct link between the 
alleles of such repeat series present and the features determining the actual nature of the 
breed. This, combined with the hypervariable nature of these regions of DNA, results 
in them rarely being breed specific (similar alleles being found in a number of different 

3 0 breeds). As there is no link to the phenotype of the breed there is a greater risk that 
cross specific alleles could exist or arise in a breed, whereas this is unlikely with breed 
determinants as they define the phenotype itself. Given the large number of populations 



of animals of specific breeds that exist, extensive research would have to be carried out 
to exclude a DNA marker from breeds other than that with which it is claimed to be 
linked. 

5 However, a major drawback with this approach is that RAPD markers are considered to 

be unreliable and found to be subject to variation between laboratories. Such problems 

• * 

are exacerbated when samples of different types and history must be analyzed and 
compared. 

10 There is therefore a need for reliable breed markers which can be used as the basis for 
rapid and inexpensive methods for identifying the breed provenance of various animal 
products and for validating animal products (such as foodstuffs and semen for use in 
breeding programmes). 

15 It has now been recognized that breed determinants as hereinbefore defined (such as 
coat colour) have unexpected advantages as breed identifiers or breed specific markers. 
In particular, it has surprisingly been discovered that the use of overt phenotypic 
characteristics as the basis for selection over long periods of time has led to particular 
alleles becoming fixed in most breeds. Such breed markers can be used to provide 

2 0 industry standard profiles for a particular breed that has application to all materials 

derived from a particular species. 

Thus, jt has now been found that many breeds are in fact genetically homogenous with 
respect to breed determinant genes (as hereinbefore defined), so that these genes may 
25 serve as the basis of reliable breed-specific markers (contrary to the prejudice in the art 
mentioned earlier regarding the utility of breed determinants per se, such as coat colour, 
in breed identification). 

Moreover, it has surprisingly been found that the nature of the breed determinant genes 

3 0 (or alleles thereof) underlying any one breed determinant (such as coat colour) may be 

highly polymorphic. Thus, variation in breed determinant genes and/or alleles between 
different breeds may exist, notwithstanding the fact that the different breed determinant 



genes/alleles may contribute to the expression of the same overt phenotypic 
characteristic. 

Prior to the present invention, it was assumed that the corresponding genetic 
5 determinants would be insufficiently polymorphic to provide a useful basis for 
distinguishing between breeds. For example, coat colour was known to be shared 
among different breeds of pig and (as mentioned above) was therefore not regarded as a 
good candidate for a breed specific marker. However, the present inventors have found 
that the alleles underlying the coat phenotype in such breeds are in fact highly 
10 polymorphic and often distinctive (and so useful as the basis for breed identification). 

Similar considerations apply to other overt physical traits (breed determinants), which 
may therefore be shared by different breeds while nevertheless associated with distinct 
genes/alleles in each breed. An example of this is seen in cattle exhibiting the double 
15 muscled phenotype. Work by Kambadur et alia (1997, Genome Research 7, 910-915) 
and Grobet et alia (1997, Nature Genetics 17, 71-74) illustrates that the double muscled 
phenotype of cattle is caused by mutations in the myostatin gene. However, in the 
Belgian Blue and Asturiana breeds, this gene contains an 1 1 bp deletion whereas in the 
Piedmontese breed a G to A transition is present. Thus, as with porcine coat colour a 

2 0 single selected characteristic is caused by a number of potential polymorphisms. 

However, the nature of the arisal and selection history for such overt physical 
characteristics leads to the fixation of particular alleles within the breeds contributing to 
the breed specific profile of determinants. 

25 In the light of these findings, it has now been recognized that genetic analysis of breed 
determinants (such as coat colour) provides an effective means for validating animal 
products (e.g. foodstuffs) and may advantageously be incorporated into animal product 
(e.g. food) processing lines to monitor and maintain product quality and into quality 
control protocols in the food industry. 

3 0 

Coat colour 

Pig breeds show a variety of coat colours and these are often associated with 



particular production characteristics. For example, white is the predominant coat 
colour among European commercial breeds e.g. Large White and Landrace, and these 
breeds are associated with larger litters and good mothering ability. However, there 
are a number of commercially important coloured breeds, demonstrating a number of 
5 colours and combinations. The Duroc, associated with meat tenderness, is red, the 
Pietrain, a heavily muscled animal which produces a very lean carcass, is spotted, and 
the Hampshire, also heavily muscled, is black with a white saddle over its shoulders. 
In addition, there may be other useful local breeds which have traits of potential 
commercial interest, and which are coloured. For example, the Chinese Meishan 
10 breed has been imported into Europe and the US because of its very large litter size. 
The European Wild Boar is brown when adult and striped when juvenile, and this 
breed is utilised to satisfy consumer demand for traditional meat products. It is also 
claimed that other local breeds or landraces are important because of their adaptation 
to local environments, e.g. temperature, endemic diseases and local feedstufFs. 

15 

Coat colour is important to the pig industry for a number of reasons. Firstly, gross 
variation in appearance (i.e. a range of coat colours) of pigs claimed to be genetically 
consistent for traits other than coat colour can lead to questions about the consistency 
and quality of the animals in the mind of pig-producers. Thus, the coat colour of the 

2 0 pig is often used as a trademark of the breed and the breeders want to ensure that their 

animals breed true for colour. For example, in several markets, local, traditional, 
coloured breeds are marketed for their meat quality or in terms of the production 
system used to rear them. However, this is not a trivial task since the coat colour is 
controlled by a number of genes. The inheritance is also complicated by the presence 
25 of dominance and interaction between genes. There is also an application in the 
assessment of the purity of the genetics of traditional breeds used as the basis for 
modem synthetic lines and the confirmation of the derivation of the latter. 

Secondly, in a number of markets there is a preference for white skinned meat. This 

3 0 is due to the fact that pork is often marketed with the skin still attached, and skin from 

coloured pigs, even if dehaired, can still exhibit colour, which can lead to negative 
perception by the consumer partly, since the surface of the meat may appear to be 



spotted by mould. It is therefore necessary in these markets to remove the skin from 
such carcasses, entailing additional cost. For example, in the US, coloured carcasses 
are associated with approximately 1% skin defects requiring dehairing and skinning to 
remove pigment. As a result of this, coloured pig carcasses are generally discounted. 

5 

One example of the problem concerns the presence of black pigmented spots 
occurring in production animals that are crossbreds between a white and a pigmented 
line. This may occur because the dominant white gene inherited from the white breed 
is not always fully dominant in the heterozygous condition which occurs in this cross. 

1 0 A possible solution to this problem would be to ensure that the production animals 
are homozygous for the recessive red allele present in breeds such as the Duroc. In 
this case the pigmented spots would be red instead of black and much less 
conspicuous. To achieve this one needs to breed the recessive red allele to 
homozygosity in both the white and pigmented line used for cross-breeding. 

15 However, this would be very difficult using phenotypic selection as selection for a red 
background colour in a white line could only be accomplished with very expensive 
progeny testing schemes. 

In addition, pig breeders would like to be able to be in a position to ensure consistency 

2 0 in breeding populations. Breeders may wish to ensure that progeny produced by 

breeding crosses were always white. Alternatively, a breeder of Duroc or Hampshire 
pigs may wish to ensure that breeding crosses always produced the characteristic 
Duroc*or Hampshire colouring. Traditional animal breeding practices have in the 
past, been used to attempt to eliminate untypical colour from pig lines. For example 
25 purebred breeders must submit potential boars for progeny testing in order to 

demonstrate that they are suitable for inclusion in the breeding herd. This procedure 
incurs significant cost, including the substantial delay to confirm sufficient matings 
and progeny have been produced before the animal can be used commercially. 

3 0 Therefore, selection based on a diagnostic DNA test for mutations in coat colour 

genes would be a major advance compared with phenotypic selection. 

Coat colour is determined by the action of a number of different gene loci. For 



10 

example, the gene determining whether a pig is white or coloured is designated / (for 
inhibition of coat colour). The version of the gene preventing the expression of any 
colour (7) is dominant to that which allows colour to develop (i). Traditional selection 
for white animals has reduced the frequency of /, but it still remains in the population of 
5 white heterozygous carrier animals. Recently, a number of structural differences in the 
alleles of the ATT gene were identified and found to be involved with this aspect of coat 
colour determination which allowed the development of methods of distinguishing 
between alleles at this locus. 

10 However, animals which carry two copies of the recessive allele, f, at this locus have 
non- white coat colours (Johansson-Moller et al, Mamm, Genome^ 7:822-830 (1996), 
WO-A-97/05278, the disclosure of which is incorporated herein by reference). Pigs of 
this type can be all one colour, such as the Duroc (which is red), or have combinations 
of colours (particularly spotted or striped or banded patterns, such as the Pietrain and 

15 Hampshire, respectively). Many other combinations are possible and are observed (see 
the table, below): 

Genotype Colour 

VI White 

20 I/i White 

i/i . Coloured 

F/F White with coloured spots 

f/I White 

F/i White with coloured patches 



25 



The non- white colour in such animals may be varying shades of red or black. The type 
of colour expressed is determined by the action of a second gene which is designated E 
(for extension of coat colour). Based on the literature, animals which contain the E 
version of the gene are completely black, and this version of the gene is dominant to 
3 0 that which results in red coat colour (e). Patched or spotted animals, such as the 
Pietrain breed, contain a third version of the gene designated 2F\ This version of the 
gene is dominant to e but not to E. For example, black animals may have the 



genotype, UEe 9 UEEF or UEE. A black sow and a black boar which were both 
heterozygous at the E locus and which were of the genotype UEe would produce both 
black and red piglets in the ratio 3:1. The black piglets would be UEE or UEe and the 
red piglets would be iiee. 

5 

The density and coverage of coat colour and the position of bands of white are 
determined by additional loci, one of which, the belt locus, is discussed later in this 
application. For example, the Hampshire breed is background black with a white band 
across its shoulders, the width of this band may vary, however, the colour should be 

10 black. There is evidence that some Hampshire animals are derived from herds that 
have been crossed previously with red breeds, such as the Duroc. In this situation, the 
red version of the gene can be maintained silently in the heterozygous state. When 
two heterozygotes are crossed 25% of the offspring will contain red. In some cases 
such pigs will have the appearance of the Duroc breed, being solid red, however, in 

15 other cases, the animals will have the white band inherited from the Hampshire and 
have the appearance of red Hampshires. It is the presence of the atypical coat colour 
rather than the pattern that is important in this situation. 

The extension locus is known in other breeds of domestic animals, such as the horse, 
20 where e is associated with chestnut colour (Adalsteinsson, J. Hered. 65:15-20(1974)), 
cattle (Klungland et al, Mammalian Genome 6: 636-639 (1995)), the fox 
(Adalsteinsson, J. Hered, 78:15-26 (1987)) and the mouse (Jackson, Ann Rev Genet. 28: 
1 89-2 1 7 (1 994)). The extension locus encodes the alpha melanocyte-stimulating 
hormone receptor (otMSHR). It has been shown that recessive alleles at this locus do 
25 not express a functional aMSH receptor (Robbiris et al, Cell, 72: 827-834, Klungland 
et al, Mammalian Genome 6: 636-639 (1995)) and these workers have identified 
mutations in the sequence of the aMSHR gene in these species associated with 
different coat colours. 

3 0 Classical segregation analyses have identified a minimum of three alleles at the pig 
extension locus: E for uniform black, E? for black spotting and e for uniform red 
(Ollivier and Sellier, Ann. Genet. Sel.Anim., 14:481-544, (1982)). The dominance 



relationship among the three alleles is as follows E>EF>e. We have now found that 
these coat colour variations are associated with sequence polymorphism in the 
ctMSHR gene in the pig. We have analysed the DNA sequence of this gene using 
samples from the following breeds with different coat colour: Wild Boar which is 
5 wild type coloured, Meishan and Hampshire which carry alleles for uniform black 
(£), Pietrain and Large White which carry alleles for black spotting (EF) and Duroc 
which is uniform red (e). In Large White the patches or spots of colour that might be 
expected due to the presence of the EF allele are hidden as this breed also carries the 
dominant white gene which prevents any expression of colour. Five different ctMSHR 

1 0 sequences were obtained one from the Wild Boar, one from Meishan, one from Duroc 
one from Hampshire, and one found in Pietrain and Large White. We have designated 
the allele found in the Wild Boar as E* and assume that the presence of this allele is 
necessary for the expression of the wild type colour. The £ alleles for uniform black 
carried by Meishan and Hampshire pigs were associated with different ctMSHR 

15 sequences. We have denoted these two alleles ET and respectively. The DNA 
sequence associated with the allele for black spotting found in the Pietrain was 
denoted^. The similarity of the EF and alleles suggests that they are derived from 
a common origin. The sequence differences presented here can be used as the basis of 
methods and kits to determine the genotype of pigs in relation to coat colour. 

20 Alternatively, alleles of linked markers, such as microsatellite or AFLP markers, 

found to be in linkage disequilibrium with these alleles could be used to predict colour 
genotype. In conclusion we have found fivedifferent ctMSHR sequences associated 
with five different extension alleles i.e. E*, EF* y EF and e. 

25 Except for the 2 base pair insertion at the 5' end of the EF allele and the 1 bp deletion 
in the 3' untranslated region of ET allele the DNA sequence differences identified in 
the ctMSHR gene are single base pair changes. Some of these are silent, however, a 
number lead to changes in the amino acid sequence of the ctMSHR protein. For 
example, the differences between e and the other alleles are two missense mutations 

3 0 in the coding sequence of the ctMSHR gene. Importantly, the differences in the pig 
gene are different from that found in other species. The cattle and mouse e mutations 
are one base pair deletions (Robbins et aU Cell, 72: 827-834 (1993); Klungland et al, 



Mammalian Genome 6: 636-639 (1995), Joerg et al, Genome 7: 317-318 (1996)), 
whilst the mutations identified here include a missense mutation (G727 changed to A) 
in a region which is conserved among human, mouse, cattle and horse gene sequences 
(Wikberg et al, WO 94/04674 (1994), Valverde et al, Nature Genet. 11: 328-330 
5 (1995), Robbins et al, Cell, 72: 827-834 (1993), , Klungland et al, Mammalian 
Genome 6: 636-639 (1995), Joerg et al, Genome 7: 317-318 (1996), Marklund et al, 
Mamm. Genome, 7:895-899 (1996)). The E? group has a dinucleotide insertion in the 
5' end of the gene after nucleotide position 66 of the Wild Boar sequence which leads 
to the creation of a stop codon further into the gene resulting in a predicted mutant 
10 polypeptide of only 54 amino acids. Finally, the Meishan allele {ET) shows four 
amino acid changes in the protein. Two of these differences are in the same region of 
the gene which is altered in cattle. 

The colours of a series of pig breeds, the classical genotypes for / and E and the 
15 determined genotypes for E based on sequencing and testing studies are shown in the 
table below: 



Breed 


Hocus 


E locus 


Colour 


Hampshire 


i/i 


EF/EF 


Black with white belt 


2 0 Large White 


VI 


EF/EF 


White 


Landrace 


VI 


EF/EF 


White 


Pietrain 


i/i 


EF/EF 


White with black patches 


Berkshire 


i/i 


EF/EF 


Black with white points 


Meishan 


i/i 


ET/ET 


Black or black with white points 


25 Duroc 


i/i 


e/e 


Red 


Wild Boar 


i/i 


E?/E? 


Brown (banded hair) 



3 0 Thus, it is possible to distinguish between the alleles of E*, IF 1 , EF and e and so 
determine the genotype of individual pigs (or the genetic provenance of products 
derived therefrom) with respect to non-white coat colour. Interestingly, the white 



14 

breeds that have been examined all appear to be fixed for alleles EF at the E locus. 
There is considered to be potentially some modifying effect of the E locus on the 
phenotype conferred by the / locus. While the basis of this is not established, the fixing 
of E? in these lines illustrates the subtle effects on loci involved with coat colour upon 
5 selection for breed characteristics thus providing more determinants among such loci 
than might be expected. 

Associations can be determined between extension locus genotype and linked markers, 
eg microsatellite sequences which are linked to the gene. A number of microsatellite 
10 markers have been located to the region of porcine chromosome 6 to which the 
aMSHR gene has been mapped. 

A number of pig breeds characteristically show the belt phenotype consisting of a 
continuous white belt over the shoulders and white fore legs. Examples of breeds 

15 demonstrating this characteristic are the British Saddleback (derived from the Wessex 
and Essex breeds) and the Hampshire, which show a white belt upon a black back 
ground and the Bavarian Landschwein characterised by a white belt upon a red 
background. The characteristic is controlled by a dominantly acting locus Belt 
designated Be for which there are thought to be two alleles (Legault 1997 in The 

20 Genetics of the Pig, Ed Rothschild M.F. and Ruvinsky A, PubL CAB International) 
(Ollivier and Sellier, Ann. Genet. Sel. Anim., 14:481-544, (1982)). Be giving rise to a 
belt and be which in the homozygous form leads to the absence of a belt. The 
heterozygous animal Be/be carries a belt but in this genotype the belt is generally 
narrower in character. 

25 

To identify the actual genetic basis of the belted and non belted phenotype studies were 
carried out using animals from a Pietrain x (Pietrain x Hampshire) cross. The Pietrain is 
be/be while the Hampshire is Be/Be. Thus the Fl generation all have the genotype 
Be/be. Further crossing of the Fl back to the Pietrain (6e/E>e)leads to the segregation of 
3 0 the Be allele between offspring, giving rise to Be/be animals showing belts and be/be 
non belted offspring. A correlation was then established between the inheritance of the 
belted condition and certain microsatellite markers within these pedigrees. This work 



15 

surprisingly identified the actual gene involved as the KIT gene also described above as 
involved in dominant white. Further analysis showed correlation of the phenotype in 
this pedigree with a polymorphism at KIT nucleotide 2678 with a C or T occurring at 
this position. The presence of a C creates a restriction site for Aci I which is absent 
5 when a T is present. Based upon these unexpected findings a number of approaches 
can be taken to the determination of the genotype for an animal at the belt locus using 
either single nucleotide polymorphisms or linked markers including microsatellites or 
other single nucleotide polymorphisms. Thus animals can be genotyped by a number 
of approaches to determine their genetic status for this particular overt characteristic. 

10 

Summary of the invention 

According to the present invention there is provided a method for differentiating 
animals and animal products on the basis of breed origin, for determining or testing the 
15 breed origin of an animal product or for validating an animal product, wherein the 
method comprises the steps of: (i) providing a sample of the animal product; and (ii) 
analyzing the allele(s) of one or more breed determinant genes present in the sample. 

As explained above, the breed determinant is an overt phenotypic trait. As used herein, 

2 0 an overt phenotypic trait is one which can be visually recognized. 

Differentiation of animal products on the basis of breed origin involves the partition of 
members of a class of different animal products into a number of different products 
sharing the same breed origin. It does not necessarily imply identification of the nature 
25 of the breed source. Animal product differentiation on this kind of basis may be 
sufficient where the consistency of source of animal products must be monitored (but its 
actual breed provenance is riot important). 

In contrast, determination of the breed origin of an animal product implies identification 

3 0 of the breed source, while testing the breed origin implies analysis sufficient to 

determine whether a breed source other than that desired has been used (without 
necessarily identifying such other breed sources in cases where they are indicated). 



16 

Validating an animal product implies confirming that it meets stipulated specifications 
as to breed provenance. Such validation may involve differentiation, determination 
and/or testing, depending on the circumstances under which the analysis is performed 
5 and the nature and extent of ancillary data which may be available. 

The sample for use in the invention may be in any convenient form. In many cases, the 
sample will be a sample of a food (e.g. meat product). For most applications, the 
sample is pre-treated (e.g. extracted, purified and/or fractionated) in such a way so as to 

10 make the alleles of a breed determinant gene or genes available for analysis (either at 
the level of nucleic acids (such as RNA or DNA) and/or proteins). The sample is 
preferably a nucleic acid sample, in which case the analysing step (ii) comprises DNA 
or RNA analysis. Alternatively, the sample may be a protein sample (where the nature 
of the protein reflects a breed determinant allele), in which case the analysing step (ii) 

15 comprises protein analysis. 

The breed determinant of the invention may be a monogenic or polygenic trait. 
Monogenic traits are preferred, since the genes conferring such traits are relatively 
easily identified and analyzed. However, in some cases it may be useful to analyze the 

2 0 alleles of polygenic traits (i.e. traits which are controlled by a plurality of genes), since 

the underlying allele polymorphism is often greater in such cases (so increasing the 
potential for breed differentiation). 

Typically, overt phenotypic traits are those traits which have been used as the basis for 
25 artificial selection during the breeding programme. The overt phenotypic trait is 
preferably a behavioural or morphological, physiological or behavioural trait. 

The overt phenotypic trait may vary qualitatively or quantitatively between breeds. 
Preferred are traits which vary qualitatively between breeds, since such traits are often 

3 0 reflected by qualitative differences in the alleles of the corresponding breed determinant 

gene(s). In such cases, analysis yields relatively robust positive-negative results, which 
are easily interpreted and compared between testing stations/laboratories. 



The breed determinant gene analysed in step (ii) may be any suitable breed determinant 
gene. Such genes may be identified and analysed by methods well known in the art 
using routine trial and error. Preferably, they are selected from any of a coat colour, 
5 pattern, texture, density or length gene; an ear aspect gene; a double muscling gene; a 
horn morphology gene; a tusk morphology gene; an eye colour gene; a plumage gene; a 
beak colour/morphology gene; a vocalization (e.g. barking) gene; a comb or watde 
gene; and/or a gene controlling display behaviour. 

10 In preferred embodiments, the breed determinant gene is the KIT or ctMSHR coat 
colour gene (for example, the pig KIT and/or aMSHR gene). 

The analysis step (ii) may comprise any of a wide range of known nucleic acid/protein 
analytical techniques. The nature of the analytical technique selected is not critical to 
15 the practice of the invention, and those skilled in the art can readily determine the 
appropriate technique according to the circumstances in which the analysis is to be 
conducted and the type of data required. 

Preferably, the analysis step (ii) comprises selectively amplifying a specific fragment of 

2 0 nucleic acid (e.g. by PCR), testing for the presence of one or more restriction 

endonuclease sites within the breed determinant gene(s) (e.g. restriction fragment length 
polymorphism (RJFLP) analysis), determining the nucleotide sequence of all or a portion 
of the £reed determinant gene(s), probing the nucleic acid sample with an allele-specific 
DNA or RNA probe, or carrying out one or more PCR amplification cycles of the 
25 nucleic acid sample using at least one pair of suitable primers and then carrying out 
RFLP analysis on the amplified nucleic acid so obtained. 

Alternatively, the analysis step (ii) comprises probing the protein sample with an 
antibody (e.g. a monoclonal antibody) specific for an allele-specific epitope, 

3 0 electrophoretic analysis, chromatographic analysis, amino-acid sequence analysis, 

proteolytic cleavage analysis or epitope mapping. For example the E? allele might be 
distinguished by any method capable of detecting an alteration in the size of the 



encoded protein; 
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In particularly preferred embodiments, the analysis step (ii) comprises determining the 
nucleotide sequence of the KIT and/or aMSHR gene or the amino acid sequence of the 
5 KIT and/or aMSHR protein. Here, the analysis may comprise establishing the presence 
or absence of at least one mutation in the KIT and/or aMSHR gene. Any method for 
identifying the presence of the specific sequence change may be used, including for 
example single-strand conformation polymorphism (SSGP) analysis, ligase chain 
reaction, mutagenically separated PCR, RFLP analysis, heteroduplex analysis, 
10 denaturing gradient gel electrophoresis, temperature gradient electrophoresis, DNA 
sequence analysis and non-gel based systems such as TaqMan™ (Perkin-Elmer). 

In the TaqMan™ system, oligonucleotide PCR primers are designed that flank the 
mutation in question and allow PCR amplification of the region. A third 
15 oligonucleotide probe is then designed to hybridize to the region containing the base 
subject to change between different alleles of the gene. This probe is labelled with 
fluorescent dyes at both the 5 f and 3' ends. These dyes are chosen such that while in this 
proximity to each other the flourescence of one of them is quenched by the other and 
cannot be detected. Extension by Taq DNA polymerase from the PCR primer 

2 0 positioned 5 f on the template relative to the probe leads to the cleavage of the dye 

attached to the 5' end of the annealed probe through the 5 f nuclease activity of the Taq 
DNA polymerase. This removes the quenching effect allowing detection of the 
florescence from the dye at the 3 1 end of the probe. The discrimination between 
different DNA sequences arises through the fact that if the hybridization of the probe to 
25 the template molecule is not complete (i.e. there is mismatch of some form), then 
cleavage of the dye does not take place. Thus only if the nucleotide sequence of the 
oligonucleotide probe is completely complementary to the template molecule to which 
it is bound will quenching be removed. A reaction mix can contain two different probe 
sequences each designed against different alleles that might be present thus allowing the 

3 0 detection of both alleles in one reaction. 

Although the TaqMan™ system is currently capable of distinguishing only two alleles, 



labelled probe primer sets could be developed in which the probes for certain target 
allele(s) are labelled with a different fluorescent dye from non target alleles. For 
example, if one wished to confirm that a group of Duroc breed pigs carried only allele e 
one could have a probe present capable of detecting this allele labelled with one 
5 fluorescent dye and probes capable of detecting all the other alleles labelled with the 
second dye. Thus one would detect the presence of any non Duroc type alleles at this 
locus. Such probe sets could be designed and labelled according to the needs of the 
experiment. 

10 The analysis step (ii) may further comprise determining the association between one or 
more microsatellite marker alleles linked to the KIT and/or aMSHR gene and to 
particular alleles of the KIT and/or aMSHR gene. 

Alternatively, the analysis step (ii) may be based on the identification of microsatellite 
15 markers present in the nucleic acid sample. 

The analysis step (ii) preferably comprises: (a) determining the association between one 
or more microsatellite marker alleles linked to the KIT and/or aMSHR gene and to 
particular alleles of the KIT and/or aMSHR gene; determining which microsatellite 
2 0 marker allele or alleles are present in the nucleic acid sample. 

The analysis step (ii) preferably further comprises the step of determining the genotype 
of at least one additional locus, for example an additional breed determinant (e.g. coat 
colour) locus. Particularly preferred as an additional locus is the ATT gene locus (e.g. the 
2 5 pig KIT gene locus). 

The analysis step (ii) preferably comprises PCR using at least one pair of suitable 
primers. In the case where the gene is the pig aMSHR gene, the at least one pair of 
suitable primers is: 

30 

aMSHR Forward Primer 1: (5 -TGT AAA ACG ACG GCC AGT RGT GCC TGG 
AGG TGT-3 1 ) 



20 

cxMSHR Reverse Primer 5: (5 -CGC CCA GAT GGC CGC GAT GGA CCG-3'); or 
ccMSHR Forward Primer 2: (5'-CGG CCA TCT GGG CGG GCA GCG TGC-3') 
cxMSHR Reverse Primer 2: (5'-GGA AGG CGT AGA TGA GGG GGT CCA-3'); or 
aMSHR Forward Primer 3: (5'-GCA CAT CGC CCG GCT CCA CAA GAC-3') 
5 aMSHR Reverse Primer 3: (5 -GGG GCA GAG GAC GAC GAG GGA GAG-3') 

The analysis step (ji) may also comprise restriction fragment length polymorphism 
(RFLP) analysis, for example involving digesting the pig nucleic acid with one or more 
of the restriction enzymes BstUl, Hhal and/or BspHl In cases where the gene is the pig 

10 aMSHR gene, this analysis may involve identification of a polymorphism at any of the 
nucleotide positions shown to be polymorphic including 283, 305, 363, 370, 491, 727, 
729, 1162 or between nucleotide positions 60 and 70 or between nucleotide 
positonslOOS and lOlOof the pig aMSHR gene. 

15 The analysis step (ii) may involve carrying out one or more PCR amplification cycles of 
the nucleic acid sample using at least one pair of suitable primers and then carrying out 
RFLP analysis on the amplified nucleic acid so obtained to determine the KIT or 
aMSHR genotype of the pig. Here, when the gene is the pig aMSHR gene the at least 
one pair of suitable primers is as defined above. 

20 

The animal product preferably comprises or consists of meat (e.g. processed and/or 
canned meat), egg, egg swab or washing, semen, blood, serum, sputum, wool, biopsy 
sample or leather. It may comprise genomic DNA, RNA or mitochondrial DNA. 

2 5 The animal is preferably a mammal (e.g. pig, cattle, dog, cat, horse, sheep, rodent or 

rabbit), fish (e.g. salmon or trout) or bird (e.g. chicken or turkey). 

The invention may be used with extremely small samples and can be used to screen 
large numbers of samples quickly and inexpensively. The invention may be adapted to 

3 0 yield absolute results, and quantification is not essential. Moreover, only small 

fragments of nucleic acid are required, and the same tests can be used on the majority of 
animal products. 



• A pplications • * 

The invention finds application in a number of areas. For example, certain breeds are 
considered to yield meat of higher eating quality, and a number of retailers now market 
5 products which claim to be derived from specific or traditional breeds (for example, 
Wild Boar crosses). The invention enables consumer organisations to validate these 
claims and also permits retailers to monitor the quality of the products with which they 
are being supplied (i.e. perform product validation). The invention finds particular 
application in validation studies carried out and used by retailers to support consumer 
10 confidence, since the linkage between a genetic marker and an overt physical feature is 
more readily grasped by the lay person than the concept of breed specific markers. This 
makes the use of such breed determinants attractive and also offers marketing 
opportunities for retailers to underpin validation schemes. 

15 There are also a number of reports of breed influences on the quality of hams produced 
by various meat processing techniques. For example, in one report hams from three 
different pig breeds were reliably classified on the basis of sensory descriptors of 
marbling, saltiness and dry cure flavour. The breed identification processes of the 
invention enables producers to validate raw materials as part of quality control. 

20 

The ability to enforce and validate raw material source uniformity also yields improved 
process control, lower costs and greater product consistency, since it has now been 
found .that heterogeneity in chemical composition of products from different breeds is 
an important factor in flavour profile variation and there may also be differences in the 

2 5 functionality of other meat components between breeds. 

The invention also finds utility in the maintenance of stock purity by animal (e.g. pig) 
breeders. The small size of traditional breed populations means that the maintenance of 
a gene pool of sufficient size to avoid the effects of inbreeding requires the importation 

3 0 and movement of stock between separate populations. A risk of genetic contamination 

is associated with such movements, and the invention may be used to reduce or 
eliminate these risks. The maintenance of biodiversity and the rare breeds providing the 
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reservoir for this diversity provides an increasing need for breed identifiers. There is for 
example a problem for breeders of the British Saddleback. Certain bloodlines of this 
breed cany a higher frequency of the be allele of the belt locus which can result in the 
production of belt-less animals which do not reach the required breed standard and 
5 decrease the value not only of that individual animal but of the whole litter. The ability 
to select against this allele when new bloodlines are introduced to an existing 
population would enable breeders to increase the genetic diversity with out the risk of 
lowering the relative standard of the particular population to that of the breed in general. 

10 The invention may also be used as part of a breeding programme to confirm particular 
crosses. This may be of enormous value in the establishment of pyramid breeding 
schemes. Particular breed characteristics such as coat colour, body shape and ear aspect 
are often altered in such crosses, yet there is a need to be able to confirm the presence of 
genetics of the desired parents. 

15 

Such visible breed characteristics for the visible confirmation of crosses are also absent 
in the use of artificial insemination, where semen may be supplied from pigs in distant 
geographical locations. 

2 0 In addition, the skilled person will appreciate that based on the information described 

herein, it is possible to provide tests for determining pig genotype, with respect to coat 
colour. Thus, the present invention also provides a method of determining the coat 
colour,genotype of a pig which comprises: 

25 (i) obtaining a sample of pig nucleic acid; and 

(ii) analysing the nucleic acid obtained in (i) to determine which allele or 
alleles of the aMSHR gene are present. 

3 0 In one embodiment of this aspect of the invention the determination in step (ii) is 

carried out by determining the nucleotide sequence of the aMSHR gene and, in 
particular, is based on determining which missense, insertion or deletion mutation is 



present in the coding region of the gene. 

In another embodiment one could first determine the association between 
microsatellite or other linked marker alleles linked to the aMSHR gene and particular 
5 alleles of the aMSHR gene. Thus, the determination in step (ii) would be based on 
identification of microsatellite marker alleles present in the nucleic acid sample. 

In a further aspect, therefore, the present invention provides a method of determining 
the coat colour genotype of a pig which comprises: 

10 

(i) determining the association between one or more microsatellite or 
other linked marker alleles linked to the aMSHR gene and particular 
alleles of the aMSHR gene; 

15 (ii) obtaining a sample of pig nucleic acid; and 

(iii) analysing the nucleic acid obtained in (ii) to determine which 
microsatellite or other linked marker allele or alleles are present. 

20 The determination of the alleles at the extension locus will indicate the background 
colour of the animal and in some cases the pattern of mixed colouration, i.e. spotting, 
but will not necessarily determine the coat colour of resulting progeny. This will be 
dependent on the genotype at other loci such as the dominant white locus, /. The 
genotype at the / locus can be determined separately as described in WO-A-97/05278. 

25 

Thus, suitably, the methods as described above may further comprise the step: 
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(iii)/(iv) determining the genotype of additional coat colour loci. 

An example of such an additional coat colour locus is the belt locus. 

In a preferred method PCR is carried out using primers that amplify a region of the KIT 
gene containing nucleotide 2678. An example of a suitable pair of primers is: 

* ■ 

forward primer 

LA93 5' - GAGCAGCCCCTACCCCGGAATGCCAGTTGA -3* 
and the reverse primer 

KIT56 5' - CTTTAAAACAGAACATAAAAGCGGAAACATCATGCGAAGG - 
3'- 

The method of analysis enables determination of the presence of a C or T at position 
2678. Suitably, the restriction enzyme Acil can be used since the presence of a C 
creates a restriction site which is absent when a T is present. Similar examinations 
within a pedigree of will allow the determination of the genotype of offspring. 

Thus, in additional aspect, the present invention provides A method of determining the 
coat colour genotype of a pig which comprises: 

(i) obtaining a sample of pig nucleic acid: and 

(ii) analysing the nucleic acid obtained in (i) to determine whether the KIT 
gene carries any polymorphism associated with Belt genotype. 

Preferably, the method comprises RFLP analysis which is suitably carried out on a 
sample of pig genomic DNA which has been amplified using PCR and a pair of 
suitable primers. 

Preferred methods for identifying the presence of the specific sequence change are 
described above in relation to breed determinants. 
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- * Brief description of the Figures 

Figure 1 Partial nucleotide sequence (a) and the derived amino acid sequence (b) of 
the porcine aMSH-R gene as determined from a number of pig breeds. Position 
numbers for the nucleotide sequence are based upon nucleotide 1 being the A of the 
ATG initiation codon. Numbers of the amino acids are in accordance with the bovine 
BDF3 sequence (Vanetti et at. FEBS Lett. 348: 268-272 (1995)) to allow comparison. 

Figure 2 : Agarose gel electrophoresis of DNA fragments obtained by digestion of 
DNA fragments amplified from the porcine aMSH-R gene with BstUl or Hhal. Lanes 
labelled M contain DNA markers of 50, 150, 300,500, 750, lOOObp. The other samples 
were derived from: 

1. Pietrain 

2. Pietrain 

3. Large White 

4. Large White 

5. Large White 

6. Duroc 

7. Duroc 

8. Hampshire 

9. Meishan 

10. Berkshire : 

11. Berkshire 

Figure 3 : Agarose gel electrophoresis of DNA fragments obtained by digestion of 
DNA fragments amplified from the porcine aMSH-R gene with BstUl (lanes labelled 
B) or Hhal (lanes labelled H). Lanes labelled M contain DNA markers of 50, 150, 
300,500, 750, lOOObp. The other samples were derived from: 

1. Retailer 1. Skin 

2. Retailer 1. Fat 

3. Retailer 1. Muscle 

4. Retailer 2. Fat 



5. Retailer 2. Muscle 
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Figure 4 : Electropherogram (4% agarose) showing RT-PCR products of KIT exon 16- 
19 with the primers KIT1F and KIT7R. The samples 1-3 and 4-6 are Swedish Large 
White and Hampshire pigs respectively. The size difference between the 424 and 301 
bp fragments is due to lack of exon 17 in the latter fraction. The two upper bands of the 
Yorkshire pigs were interpreted as heteroduplexes (HD). 

Figure 5 : A 48 bp sequence is shown comprising 21 bp of KIT exon 17 and 27 bp of 
KIT intron 17. The position of the intron/exon border is marked with a vertical line and 
the splice site mutation (ntl G ~* A ) indicated with a vertical arrow. Identical bases in 
alleles f and / are marked with a dot. 

Figure 6 : Malll PCR RFLP test used to detect the presence of a splice site mutation in 
intron 17 of the KIT gene. Figure 6A shows the position of two Nlalll recognition sites 
within the PCR product amplified using primer pair KIT21 and KIT35. All distances 
are given in base pairs. Figure 6B shows the size of fragments which result following 
Malll digestion of either normal KIT or splice mutant KIT. Figure 6C illustrates use of 
the PCR RFLP test. Lane 1 shows the KIT21/KIT35 amplified fragment undigested. 
Digestion was performed on PCR products amplified from, in Lane 2: a clone which 
contains the splice site mutation; Lane 3: a clone which contains the normal splice site 
sequence; Lane 4: genomic DNA from a coloured pig; Lane 5: genomic DNA from a 
white pig. Fragment sizes are given in base pairs. 

Figure 7 : Comparison of the ratio of normal to splice mutant KIT for three classes of 
genotype. 

Figure 8 : Comparison of the ratio of normal to splice mutant KIT for two breeds of pig. 

Figure 9 : SSCP analysis of the KIT gene in Swedish Landrace (lanes 1-8) and Wild 
Boar (lanes 9 & 10) breeds. The two polymorphic bands are indicated. 
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Figure 10 : Nucleotide sequence of the porcine KIT cDNA from an animal of the 
Hampshire breed. The sequence is numbered with the first nucleotide of the N terminal 
methionine codon taken as 1 . 

Figure 11 : Polyacrylamide gel electrophoresis of PCR-RFLP analysis of KIT gene at 
polymorphic nucleotide 2678 in a number of animals. Lanes: 1 & 2, Hampshire Wild 
Boar respectively, both homozygous for the C at position 2678. Lanes 3-7 and 9 & 10, 
unrelated Large White sows all homozygous for T at position 2678. Lane 1 1, a Pietrain, 
homozygous for T at this position and lane 8 a Large White sow heterozygous for C 
and T. Lane 12 contains undigested PCR product and lane M DNA size standards. 

Figure 12 

Nucleotide sequence of the 3' end of the porcine aMSHR coding region and adjacent 
3' untranslated region. The TGA stop codon is highlighted in bold, the primer binding 
sites for EPIG14 is shown in italics. Numbering is based on the system used in figure 
la in which nucleotide 1 is the A of the ATG initiation codon of the Wild Boar 
sequence. Bases in common with the European Wild Boar are marked with a dash. 
Missing bases are marked with a :. 

Examples 

Example 1 : Determination of the sequence of the aMSHR gene 
The DNA sequence of the porcine aMSHR gene was determined through the DNA 
sequencing of a combination of PCR products and cloned portions of porcine DNA. 
♦ 

Preparation of template DNA for PCR 

DNA can be prepared from any source of tissue containing cell nuclei, for example 
white blood cells, hair follicles, ear notches and muscle. The procedure here relates to 
blood cell preparations; other tissues can be processed similarly by directly 
suspending material in K buffer and then proceeding from the same stage of the blood 
procedure. The method outlined here produces a cell lysate containing crude DNA 
which is suitable for PCR amplification. However, any method for preparing 
purified, or crude, DNA should be equally effective. 

Blood was collected in 50mM EDTA pH 8.0 to prevent coagulation. 50^il of blood 



28 

was dispensed into a small microcentrifuge tube (0.5ml Eppendorf or equivalent). 
450|il of TE buffer was added to lyse the red blood cells (haem groups inhibit PCR) 
and the mix vortexed for 2 seconds. The intact white and residual red blood cells 
were then centrifuged for 12 seconds at 13,000 g in a microcentrifuge. The 
supernatant was removed by gentle aspiration using a low pressure vacuum pump 
system. A further 450^il of TE buffer was then added to lyse the remaining red blood 
cells and the white blood cells collected by centrifugation as before. If any redness 
remained in the pellet, this process was repeated until the pellet was white. After 
removal of the last drop of supernatant from the pelleted white blood cells, lOOjil of K 
buffer containing proteinase K was added and the mixture incubated at 55 degrees C 
for 2 hours. The mixture was then heated to 95-100 degrees C for 8 minutes and the 
DNA lysates stored at -20 degrees C until needed. 

Reagents 

T.E. Buffer: lOmM TRIS-HC1 pH8.0 

ImMEDTA 

K Buffer: SOmMKCl 

10mMTRIS-HClpH8.3 
2.5mMMgC12 
, 0.5%Tween20 

PCR to produce DNA sequencing template 

The otMSHR gene was amplified for sequence analysis using three primer pairs. 

Primers MSHR Forward Primer 1 : (5'-TGT AAA ACG ACG GCC AGT RGT GCC 
TGG AGG TGT CCA T-3 1 ); and 

MSHR Forward Primer 5: (5'-CGC CCA GAT GGC CGC GAT GGA CCG-3') 



amplify a 428 bp fragment from the 5 1 half of the gene. 
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Primers MSHR Forward Primer 2: (5'-CGG CCA TCT GGG CGG GCA GCG 
TGC-3'); 

and aMSHR Reverse Primer 2: (5-GGA AGG CGT AGA TGA GGG GGT CCA-3') 

amplify a 405 bp fragment the 3' half of the gene. 

As these two fragments are non-overlapping a third primer pair 

aMSHR Forward Primer 4 (5'-TGC GCT ACC ACA GCA TCG TGA CCC TGC-3*); 

and 

aMSHR Reverse Primer 4 (5-GTA GTA GGC GAT GAA GAG CGT GCT-3') 

were used to amplify a 98 bp fragment which spans the 50 bp gap. PCR was carried 
out on a DNA thermal cycler (Perkin Elmer 9600) in a total volume of 20 nl 
containing 25 ng genomic DNA, 1.0 mM MgC12, 50 mM KC1, 10 mM Tris-HCl, pH 
8.3, 200 (M dNTPs, 0.5 U AmpliTaq Gold (Perkin Elmer) and 10 pmol of both 
forward and reverse primer. To activate AmpliTaq Gold, initial heat denaturation was 
carried out at 94 degrees C for 10 minutes followed by 32 cycles each consisting of 45 
sec at 94 degrees C, 45 sec at 53 degrees C and 45 sec at 72 degrees C. The final 
extension lasted for 7 min at 72 degrees C. PCR products were cloned into vector 
pUC18 using the SureClone ligation kit (Pharmacia). 

Preparation of plasmid DNA 

Plasmid DNA was purified from overnight bacterial culture using the Jetstar plasmid 
midi kit 50 (Genomed) and the resulting DNA diluted to 1 50 ng/|il. 

Sequencing of plasmid DNA 

Cloned plasmid inserts were sequenced using dye primer chemistry. Each cycling 
reaction was prepared with template and ready reaction mix containing fluorescently 
labelled Ml 3 forward or reverse primer as described in the ABI Prism protocol P/N 
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4021 13 (Perkin Elmer). Cycling and sample pooling was performed using a Catalyst 
800 Molecular Biology Workstation (ABI) following the instruments user manual 
(Document number 903877, Perkin Elmer). The resulting extension products were 
purified, loaded and analysed using the 377 ABI Prism DNA sequencer as described 
by the instrument protocol (Perkin Elmer protocol P/N 402078). 

Dve Terminator Sequencing of PCR products 

Dye terminator DNA sequencing requires purification of PCR product free from 
excess dNTPs and residual primers. This was achieved by passage of the template 
DNA through QiaQuick spin columns (Qiagen) before the purified DNA was diluted 
to 15 ng/(il. Dye terminator cycle sequencing was performed using AmpliTaq DNA 
polymerase FS in accordance with the ABI Prism protocol P/N 402078 (Perkin 
Elmer). Cycle sequencing reactions were performed in a total reaction volume of 10 
111. This comprised 1 .6 pmole of either the forward or reverse primer used to amplify 
the target fragment from genomic DNA, 20 ng of purified template DNA and 
terminator ready reaction mix (Perkin Elmer) which contains each of four dye 
terminators, dNTPs, Tris-HCl (pH 9.0), MgCl 2 , thermal stable pyrophosphate and 
AmpliTaq DNA polymerase FS. Cycle sequencing was performed with a GeneAmp 
9600 machine (Perkin Elmer) over 25 cycles, each consisting of 10 sec at 96 degrees 
C, 5 sec at 50 degrees C and 4 min at 60 degrees C. Extension products were purified 
for gel separation-using eihanoi precipitation, loaded and run on a 377 ABI Prism 
DNA sequencer as described by the instrument protocol (Perkin Elmer protocol P/N 
402078). 

Results 

The partial coding region DNA sequence of the porcine aMSHR gene sequence from 
a number of pig breeds is given in figure la combined with sequence determined in 
example 22. The derived amino acid sequence is shown in figure lb. 

Example 2: PCR-RFLP based discrimination of alleles at the E locus 

DNA preparation for PCR 
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• As in example 1: 1 . 
PCR 

Reactions were set up in a 20^1 reaction volume in thin walled 0.25ml tubes (Perkin 
Elmer) with the following components: 

20jil reaction volume: 
2jal template DNA 
1.5mMMgCl 2 
200jiM each dNTP, 

3 pM each of forward and reverse primers 
0.5 U AmpliTaq Gold (Perkin Elmer) 

MSHR Forward primer 3 sequence: 5' GCA CAT CGC CCG GCT CCA CAA GAC 

y 

MSHR Reverse primer 3 sequence: 5* GGG GCA GAG GAC GAC GAG GGA GAG 

y 

The reaction tubes were placed on a Perkin Elmer 9600 thermal cycler preheated to 94 
degrees C and PCR carried out according to the regime below: - 
~ Initial denaturation step of 94 degrees C for 1 0 min. 
# 33 cycles: 94 degrees C - 45 sees 
53 degrees C - 45 sees 
72 degrees C - 45 sees 
The last cycle is followed by a final elongation of 72 degrees C for 7 min. Samples 
are stored at 4 degrees C until required. 

Restriction Enzyme Digestion and Electrophoresis 

The PCR amplification product is 148bp in length. To test for polymorphism in the 
amplified products the reaction is split into two aliquots of 10|j.l each of which is 
digested with flfcal (GIBCO-BRL) or BstUl (New England Biolabs). The reactions 
are set up and incubated as below: 
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BstW digest tfAal digest 

lOul amplified DN A lOul amplified DNA 

2.5n BstUl 2.5\iHha\ 

60 degrees C 60 minutes 0.5ul lOx React 2 buffer (GIBCO-BRL) 

37 degrees C 60 minutes 

Following digestion, 2\x\ of loading dye is added to each reaction (lOOmM Tris 
pH8.0, lOOmM Boric Acid, ImM EDTA, 50% (v/v) glycerol, 0.02% w/v Orange G) 
and the mixes loaded on a 4% agarose gel (3% NuSieve/1% Seakem, FMC 
Byproducts) in 0.5x TBE (44.5mM Tris pH8,0, 44.5 mM boric acid and 0.5mM 
EDTA) and electrophoresed for 1 hour at 150v. 

Products are visualised by ethidium bromide staining. 

Results 

BstUl and Hhal digestion each result in bands of 61 and 87bp. The relationship of 
digestion to the possible allele is as shown in the table below: 

Relationship of restriction digest profiles to individual alleles at the E locus 

Allele # Digestion with BstUl Digestion with Hhal 

Yes Yes 
ET No Yes 

e No No 



If the uncut alleles are designated as allele 1 and the alleles digesting with each 
enzyme as allele 2 the various genotypes will be as shown in the table below: 
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. Actual E genotvpeis and associated scores 



Genotype 


BsiUl 


Hhal 


ET/ET 


1/1 


2/2 


E m /E p or ET/E*- 


1/2 


2/2 


ET/e 


1/1 


1/2 


ET/Ef or Ef/E* or E*/E* 


2/2 


2/2 


EF/e or E*/e 


1/2 


1/2 


e/e 


1/1 


1/1 



Note: The results for animals carrying the allele will be the same as those carrying 
EP. 

Samples were prepared from a number of pigs and tested according to the above 
protocol. The results are shown in the table below and figure 2 illustrates the patterns 
seen upon electrophoresis. 

E genotypes determined for a ranee of breeds using the BsiUllHhal digestion system 
Breed No Tested Genotype (see note 1) aMSHR type 









- BsfyJl - 


"Hhal 


Hampshire 


9 


£*/£* 


2/2 


2/2 


Large White 


4 


Ef/E? 


212 


2/2 


Landrace 


1 


EF/Ef 


212 


212 


Pietrain 


3 


ET/EF 


212 


212 


Berkshire 


2 


EF/EP 


2/2 


212 


Bazna 


4 


Ef/ET 


1/2 


2/2 






ET/Ef" 


1/1 


2/2 


Duroc 


4 


e/e 


1/1 


1/1 


Meishan 


3 


E m /E m 


1/1 


2/2 



Note 1. Thegenotype cannot be distinguished from E* or in this particular test. 
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As can be seen from the results above the genotypes determined fit with those 
expected from the sequencing data given in figure 1 a for Hampshire, Large White, 
Meishan and Duroc. The additional breeds typed here show the genotypes expected 
from their phenotype and descriptions in published literature (Ollivier and Sellier, 
Ann. Genet. Sel. Anim., 14: 481-544, (1982)). The Pietrain is a white breed with 
black patches of varying extent and has long been considered to be EF (in agreement 
with the result here). The Berkshire, originally a spotted breed, is now a mainly black 
animal with white 'socks' again generally considered to be EF as was found here. The 
Landrace is a white animal due to it carrying the dominant white allele at the / locus, 
however its genotype at the E locus has been shown to be EF from classical breeding 
studies. Once again this is in agreement with the results obtained here. The Bazna is 
a Romanian breed having black base colour with a white belt. It was developed from 
the Berkshire and Mangalitza, a Hungarian breed with a number of colour variations 
including black (Porter, Pigs, a handbook to breeds of the world, publ: Helm 
Information, ISBN 1-873403-17-8 (1993)). The ancestory.of the Bazna being based 
upon a black breed potentially carrying a similar allele to the Meishan, and the 
Berkshire carrying EF, is in agreement with the alleles found to be present in the breed 
in this work. 

Example 3: Validation of source breeds of retail meats 
DNA preparation 

DNA was prepared from different parts of pork chops from two separate retailers. 
The DNA was prepared from skin (1 retailer only), fat and muscle using the Promega 
Wizard Genomic DNA preparation kit according to the manufacturers instructions. 
Approximately 4mm 3 of each tissue was cut into small fragments for the extraction. 



PCR and Restriction Digest Analysis 

This was carried out exactly as in example 2. 
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Results • 

The results are shown in figure 3. It can be seen that DNA extracted from a range of 
tissue types can be utilised for this DNA based test with results being obtained here 
for muscle, fat and skin. The genotype of the pig with regard to the (MSHR gene can 
then be determined. In this case the material from both retailers was derived from an 
animal of test type BstUl 1/2 and Hha\ 1/2 using the nomenclature as in example 2. 
This translates into genotype EF/e or E*/e. Based on our current knowledge of the 
distribution of the alleles in commercial pig breeds the conclusion can be drawn that 
both source animals contain genetic material derived from the Duroc. 

Example 4: Validation of source breeds of processed meat samples 

Method 

DNA was prepared from heat treated meat samples according to the method of Meyer 
et al. (Journal of AOAC International, 78 1542-1551). Meat samples were minced 
with a scalpel and 0.3g transferred to a sterile 1.5ml eppendorf tube containing 430jil 
of extraction buffer (lOmM Tris-HCl pH 8.0, 150mM NaCl, 2mM EDTA, and 1% 
w/v sodium dodecyl sulphate). Fifty microlitres of 5M guanidine hydrochloride and 
20\i\ of 20mg/ml proteinase K (Boehringer) were added and mixed by inversion 
followed by incubation at 57°C for 3h. After digestion samples were centrifuged foi 
TO iniri at 13,000 x g, and~450nl of the aqueous phase added to 1ml Wizard DNA 
purification resin (Promega). The mixture was mixed by gentle inversion and 
following the Wizard DNA clean-up procedure carried out according to the 
manufacturers instructions the purified DNA was eluted with 50^il of 70°C water. 1 \x\ 
of a 1 :10 dilution was then used as template in a 10|il PCR. 

PCR was carried out as described in the previous example. 

Results 

Meat samples from a Large White based line and a Duroc based line heated at 80°C 
for 30 mins could be differentiated on the basis of their genotype at the E locus with 
the Large White samples giving a pattern characteristic of the EF allele and the Duroc 
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samples a pattern characteristic of the e allele. 
Example 5: Validation of source breeds of semen 

Genomic DNA was isolated from porcine semen. 1ml of semen was centrifuged for 2 
min at 13,500 x g and the supernatant removed. 1 ml of 2xSSC was added and the 
mix vortexed to resuspend the sperm. The mix was then centrifuged as before and the 
supernatant removed. 400|il of 0.2M NaOAc pH 7.0 was added and the mix vortexed 
followed by the addition of 34nl of B-mercaptoethanol. The mixture was incubated at 
40°C for 30 min followed by the addition of lOOfil of 10% w/v sodium dodecyl 
sulphate and 50^il of 15 mg/ml Proteinase K (Boehringer) and further incubation at 
40°C for 3 hours. 500fil phenol equilibrated with Tris-HCl pH 8.0 was added and the 
mix vortexed twice followed by centrifugation at 13,500 x g for 4 min. 400|il of the 
aqueous phase was removed and 800^1 of ethanol added. DNA was allowed to 
precipitate for 5 min at room temperature followed by centrifugation at 13,500 x g for 
5 min. The pellet was washed with 800(il 70% ethanol v/v and air dried followed by 
resuspension in 200fil of Wizard DNA resuspension buffer (Promega). 1 \xl of a 1/10 
dilution was used in a \Q\i\ PCR 

PCR was carried out as described in example 2. 

Results ^ " "* 

Semen form a Hampshire based line and a Duroc based line could be differentiated on 
the basis of their genotype at the E locus with the Hampshire samples giving a pattern 
characteristic of the £* allele and the Duroc samples a pattern characteristic of the e 
allele. 

Example 6: Discrimination of allele E* from alleles 
DNA preparation 

DNA was prepared as described in example 1. 



PCR 
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. Reactions were set up in a 20jil reaction volume in thin walled 0.25ml tubes (Perkin 
Elmer) with the following components: 

lOjil reaction volume: 
2\i\ template DNA 
2.5 mM MgCl 2 
200jiM each dNTP, 

5pmol each of forward and reverse primers 
0.5 U AmpliTaq Gold (Perkin Elmer) 

Forward primer sequence: 5' CTG CCT GGC CGT GTC GGA CCT G 3' 
Reverse primer sequence: 5' CTG TGG TAG CGC AGC GCG TAG AAG 3 ' 

The reaction tubes were placed on a Strategene Robocycler and PCR carried out 
according to the regime below: - 
Initial denaturation step of 94°C for 1 0 min. 
30 cycles: 94°C - 60 sees 

61°C - 60 sees 

72°C - 60 sees 

The last cycle is followed by a final elongation of 72°C for 7 min. Samples are held at 
6°C until required. ---- - ~ ™- 

Restriction Enzvme Digestion and Electrophoresis 

The PCR amplification product is 228 in length. To test for polymorphism in the 
amplified products the reaction is digested with BspHl (New England Biolabs). The 
reactions are set up and incubated as below: 

BspHl digest 

lOjal amplified DN A 

1 ul 1 Ox React 2 (NEB New England Biolabs) 
0.5^1 deionised water 
5 units BstUl 



37°C60 minutes 
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Following digestion, 2\A of loading dye is added to the reaction (lOOmM Tris pH8.0, 
lOOmM Boric Acid, ImM EDTA, 50% (v/v) glycerol, 0.02% w/v Orange G) and the 
mix loaded on a 4% agarose gel (3% NuSieve/1% Seakem, FMC Bioproducts) in 0.5x 
TBE (44.5mM Tris pH8.0, 44.5 mM boric acid and 0.5mM EDTA) and 
electrophoresed for 1 hours at 150v. 

Products are visualised by ethidium bromide staining. 

Results 

BspHl digestion each result in bands of 124 and 104bp. The relationship of digestion 
to the possible allele is as shown below: 

Relationship of restriction digest profiles to individual alleles at the E locus 



Allele 


Digestion with BspHl 


Ef/Ef 


Yes 


Ef 


No 



Samples were prepared from a number of pigs and tested according to the above 
protoool and the results are shown below: 

£ genotypes determined for a range of breeds using the BspHl digestion system 



Breed 


No Tested 


Genotype (see note 1 ) 


Number 


Wild Boar x 


3 


Ef/E* 


3 


Swedish 








Landrace 








Large White 


4 


Ef/Ef 


4 


Landrace 


1 


Ef/Ef 


1 


Pietrain 


3 


Ef/Ef 


3 
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Note 1. Where the genotype E p is listed this cannot be distinguished from £* in this 
particular test. a 

Example 7: Discrimination of cattle products bv breed 

DNA was prepared from cattle muscle samples as described in example 4. PCR was 
then carried out in a lOOul reaction using the primer pair: 

5 '-TGAGGTAGGAGAGTTTTGGG-3 ' and 
5 '-TCGAAATTGAGGGGAAG ACC-3 ' 

as described in Kambadur et al. Genome Research 7: 910-915 (1997) at a 
concentration of 500nM with other reaction components being 2.5mM MgCh, 200uM 
dNTPs, 50mM KC1, lOmM Tris-HCl pH 8.3, 5 units AmpliTaq Gold (Perkin Elmer). 
1 ul of bovine genomic DNA was used as template. Denaturation Was carried out for 
12 min at 94°C followed by 30 cycles of 94°C for 1 min, ,55°C for 1 min, 72° 1.5 min 
followed by 5 min at 72°C. Following PCR 2.0ul of loading dye (44.5mM Tris pH 
8.0, 44.5mM boric acid, 0.5mM EDTA, 50%w/v glycerol, 0.02% w/v Orange G) was 
added to lOul of product and analysis carried out by electrophoresis on a 2% agarose 
gel prepared in 0.5x TBE buffer (44.5mM Tris pH 8.0, 44.5mM boric acid, 0.5mM 
EDTA) for 1 hour at 100V. 

* 

The remainder of the PCR was analysed for DNA sequencing using ABI dye 
terminator chemistry as described in example 1. 

Results 

Bovine mvostatin DNA polymorphisms and related phenotvpe 

Breed Phenotype nt position 941 length PCR product (bp) 

Belgian Blue Double muscle G 482 

Piedmontese normal A 493 

Holstein-Friesian Double muscle G 493 



. Example 8 
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RT-PCR of porcine KIT exon 16-19 

L mRNA purification from blood samples 

Fresh blood samples were collected in citrate tubes from coloured Hampshire pigs and 
Large White pigs. Leukocytes were isolated from 5 ml blood using Ficoll 100 
(Pharmacia Biotech). Isolation of mRNA from leukocytes was then carried out using 
the Quickprep Micro mRNA purification kit (Pharmacia Biotech). The mRNA was 
stored as a precipitate under ethanol at -70°C for up to one month before use in 
reverse transcriptase (RT)-PCR. 

ii. RT-PCR of KIT exon 16-19 

First-strand cDNA synthesis was accomplished using the First-Strand cDNA 
Synthesis kit (Pharmacia Biotech) so that -100 ng mRNA was randomly primed by 
0.1 \ig pd(N6) in a total volume of 1 5 pi. Two \i\ of the completed first cDNA strand 
reaction was then directly used per 12 \i\ PCR reaction by adding 10 ^1 PCR mix 
containing 10 pmol each of the mouse/human derived primers KIT IF and KIT7R (5'- 
TCR TAC ATA GAA AGA GAY GTG ACT C and 5'-AGC CTT CCT TGA TCA 
TCT TGT AG, respectively; Moller et al. 1996, supra), 1.2 \x\ 10 x PCR-buffer (10 
mM Tris-HCl, pH 8.3, 50 mM KCl) and 0.5 U of AmpliTaq polymerase (Perkin- 
Elmer) incubated with an equal amount Taqstart antibody (Clontech) at 25 °C for 5 
min tq achieve a hot start PCR. The reaction was covered with 20 fxl mineral oil and 
thermocycled in a Hybaid Touchdown machine (Hybaid) with 40 cycles at 94°C for 1 
min, 55-48 °C (touchdown one degree per cycle the first seven cycles and then 48°C 
in the remaining cycles) for 1 min and 72°C for 1 min. After PCR 2jil loading dye 
was added to each sample which were then loaded on 4% agarose gel 
(Nusieve/Seakem 3:1, FMC Bioproducts) and electrophoresed with 100V for 80 min. 
Products were visualised by ethidium bromide staining and UV-illumination. 

iii. Cloning and sequencing of RT-PCR-products 

The RT-PCR products representing KIT exon 16-19 were purified by extraction from 
2% agarose gels using the QIAEX gel extraction kit (QIAGEN) and cloned into the 
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pUC18 vector using the Sureclone ligation kit (Pharmacia Biotech), Plasmids were 
isolated using the QIAFilter plasmid Midi kit (QIAGEN). Cloned plasmid inserts 
were sequenced using dye primer chemistry. Each cycling reaction was prepared with 
plasmid template DNA and ready reaction mix containing fluorescently labelled Ml 3 
forward or reverse primer as described in the ABI Prism protocol P/N 4021 13 (Perkin 
Elmer). Cycling and sample pooling were performed using a Catalyst 800 Molecular 
Biology Workstation (ABI) following the instruments user manual (Document 
number 903877, Perkin Elmer). The resulting extension products were purified, 
loaded and analysed using the 377 ABI Prism sequencer as described by the 
instrument protocol P/N 402078 (Perkin Elmer). 

iy Results and discussion 

A 424 bp fragment including KIT cDNA exon 16-19 was amplified from all pigs. The 
Hampshire pigs did not show any additional products whereas the Large White pigs 
(eight tested) all showed a 301 bp truncated cDNA fragment (Fig 4). Sequence 
analysis revealed the 424 bp fragment was identical in the two breeds whereas the 
whole exon 17 (123 bp) was missing from the 301 bp fragment. Apparent differences 
between individuals regarding the relative amounts of these two products may have 
been caused either by different genotypes containing differing numbers of copies of 
the KIT gene sequence, individual differences in mRNA expression levels or random 
RT-PCR effects. . r . — : - 

The two upper fragments present in Large white pigs represent heteroduplexes 
between the 301 and 424 bp fragments (Fig. 2). This was shown by an experiment 
where these slow migrating fragments were generated by pooling homoduplexes of 
the 424 and 301 bp which were then heat denatured and cooled to 25°C. Moreover, 
cloning of the lower heteroduplex fraction of a Large White pig resulted in clones 
with insert length corresponding to either of the two homoduplexes. 
Example 9 



PCR Amplification and Sequencing of KIT Exon 17-Intron 17 (5* Splice Site) 
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L PCR to produce' DNA Sequencing Template 

A 175 bp region including the boundary between exonl7 and intronl7 of the KIT 
gene was amplified for sequence analysis using forward primer KIT21 (5 5 - GTA 
TTC ACA GAG ACT TGG CGG C -3') and reverse primer KIT35 (5' - AAA CCT 
GCA AGG AAA ATC CTT CAC GG - 3'). PCR was carried out on a DNA thermal 
cycler (Perkin Elmer 9600) in a total volume of 20 |il containing 25 ng genomic 
DNA, 1.0 mM MgCfe, 50 mM KC1, 10 mM Tris-HCl, pH 8.3, 200 ^iM dNTPs, 0.5 U 
AmpliTaq Gold (Perkin Elmer) and 10 pmol of both KIT21 and KIT35 primer. To 
activate AmpliTaq Gold, initial heat denaturation was carried out at 94°C for 10 
minutes followed by 32 cycles each consisting of 45 sec at 94°C, 45 sec at 55°C and 
45 sec at 72°C. The final extension lasted for 7 min at 72°C. PCR products were 
cloned into vector pUC 1 8 using the SureClone ligation kit (Pharmacia Biotech). 

iL Preparation of Plasmid DNA 

Plasmid DNA was purified from overnight bacterial culture using the Jetstar plasmid 
midi kit (Genomed) and the resulting DNA diluted to 150 ng/jal. 

iii. Sequencing of plasmid DNA 
DNA was sequenced as in example 8. 

iv. Results ~ 

A portjon of the DNA sequence from exon 17 and intron 17 of the KIT gene was 
determined and compared between animals with each of these three alleles. Figure 5 
shows that the / allele carries a splice site mutation at position 1 of intron 17. This G 
to A base substitution is present in one of the two gene copies carried on each 
chromosome. The base substitution occurs in the invariant GT dinucleotide which 
characterises 5' exon/intron boundaries. Analysis of the f allele showed the splice 
site mutation was not present in either the normal (KIT1) or duplicated copy of the 
gene (KIT2). We have found the splice site mutation is unique to the / alleles, and 
therefore makes it possible to distinguish the I-KIT2 sequences. 



Example 10 
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Testing For the Presence of the Splice Site Mutation with PCR RFLP 

To easily test for the presence of the G to A splice site mutation, restriction 
endonuclease Malll (CATG) was used to exploit the point substitution identified at 
position 1 of intron 1 7 (Figure 5). The Malll recognition sites in the fragment 
amplified from ATT and the expected restriction products are illustrated in Figure 6A 
and 6B respectively. 

L PCR to produce DNA for RFLP Test 

The PCR to produce DNA for RFLP analysis was performed exactly as described in 
example 9. 

ii. Restriction Enzyme Digestion and Electrophoresis 

The PCR amplification product is 175 bp in length. To test for polymorphism at 
position 1 of intron 1 7, digestion reactions were set up as below: 

3.0 |il PCR amplified DNA 
1 .0 jil 10 X NEBuffer 4 
0.1 pi BSA 100 pg/ml 

0.1 plA7aIII lOU/pl - ~ — 

5.8 fxl dH20 

» 

(1 X NEBuffer 4 (New England Biolabs) contains 50 mM potassium acetate, 20 mM 
Tris acetate, 10 mM magnesium acetate and 1 mM DTT). Following incubation at 
37°C for 90 minutes each 10 p.1 reaction volume had 2 fxl of loading dye added and 
the mix loaded on a 8% native polyacrylamide gel (Protogel, 37.5: 1 
acrylamide:bisacrylamide , National Diagnostics, Atlanta) in 0.5 X TBE (44.5 mM 
Tris pH 8.0, 44.5 mM boric acid and 0.5 mM EDTA) and electrophoresed for 3 hours 
at 200V in a vertical slab unit (SE600 Hoefer Scientific Instruments). Products were 
visualised by ethidium bromide staining. 



iii Results 
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A PCR RFLP protocol was designed to test for the presence of the splice site mutation 
as the substitution occurs within the recognition site for restriction endonuclease 
Malll. Figure 6B illustrates that presence of the G to A base substitution at position 1 
of KIT intron 17 results in restriction at each of two Nlalll recognition sites within the 
175 bp DNA fragment. Following electrophoresis, this results in fragments of sizes 80 
bp, 54 bp and 41 bp. Where the splice site mutation is absent however, incubation 
with Nlalll results in digestion only at recognition site 1. Following electrophoresis 
this results in fragments of 134 bp and 41 bp. The invariant Malll recognition site 1 
serves as an internal control to ensure complete digestion has taken place. Results of 
this PCR RFLP analysis are illustrated in Figure 6C. Analysis was performed on 
fragments amplified from clones which either carry the splice site mutation (lane 2) or 
carry the normal splice site sequence (lane 3). Lane 4 shows the result of analysis 
where DNA amplified from the genomic DNA of a coloured animal was used. Lane 5 
shows the resulting bands where a white animal was tested. The test was used to 
analyse 121 individuals from seven different breeds of pig. The splice site mutation 
was found only in the 97 animals with the dominant white phenotype (//- or /*//) and 
none of the 24 coloured {f or /) examples (see table below). This analysis confirms / 
and /* to be unique in that they are the only alleles to carry the splice site mutation. 

Distribution of the Splice Site Mutation Between Different Breeds and Coat 
Phenotype - 



Breed Coat Assumed Animals Normally spliced Splice 

Colour Genotype 1 Tested KIT 2 Mutation 2 



Large White 


white 


I/- 


33 


33 


33 


Landrace 


white 


I/- 


56 


56 


56 


Hampshire 


coloured 


i/i 


5 


5 


0 


Duroc 


coloured 


i/i 


5 


5 


0 


Pietrain 


coloured 


i/i 


8 


8 


0 


Meishan 


coloured 


i/i 


5 


5 


0 


Wild Boar 


coloured 


i/i 


1 


1 


0 


Wild Boar 


white 


I*/- 


8 


8 


8 



x Large 

White 

Totals 



45 

white I/- 89 89 89 

white I*/- 8 8 8 

coloured i/i 24 24 0 



1 White animals may be homozygous or heterozygous for the / allele 

2 Presence of the splice site mutation determined by Malll PCR RFLP test 

Example 1 1 

Quantification of Normal KIT and Splice Mutant KIT Hntron 17 ntl G ~» A ^ 
As the splice site mutation is present in only one of the duplicated regions of /and not 
in the duplicated region of f 9 the various genotypes can be expected to have the 
attributes described in the table below: 



Genotype Copies of Normal Copies of KIT Ratio of normal KIT to 

KIT containing the splice splice mutant KIT 

mutation 



I/I 2 2 1:1 

I/i 2 1 _ „ 2:1 

i/i 2 0 2:0 

W ? * 3 1 3:1 

I P /i 3 0 3:0 



Due to the dominance of allele /, three of the genotypes in the table are carried by white 
animals and therefore can not be identified by phenotypic characterisation. 
Quantification of the relative amounts of the normal KIT gene and the splice mutant 
ATT gene allows the ratio between the two to be calculated, and therefore the genotype 
of individual animals predicted. This was achieved by quantification of two DNA 
fragments following Malll digestion. The amount of 134 bp fragment, representative of 
the normally spliced KIT gene, and of 54 bp fragment, representative of the splice 
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mutant KIT, were measured following electrophoresis using GeneScan software. 

L PCR to Produce DNA for Quantification 

As described in example 9 section i. The reverse primer KIT35 is labelled with the ABI 
fluorescent dye F AM at the 5' end. 

ii Restriction Enzyme Digestion 
As described in example 9 section ii. 

in Electrophoresis and Quantification of DNA Fragments 

Following digestion, 0.5 jil of the reaction volume was mixed with 2.5 ^1 of deionised 
formamide, 0.5 jal of GS350 DNA standard (ABI) and 0.4 ^1 blue; dextran solution 
before being heated to 90°C for 2 minutes and rapidly cooled on ice. Three jol of this 
mix was then loaded onto a 377 ABI Prism sequencer and the DNA fragments 
separated on a 6% polyacrylamide gel in 1 X TBE buffer for 2 hours at 700 V, 40 mA, 
32 W. The peak area of fragments representative to both the normal and splice mutant 
forms of KIT were quantitated using the GeneScan (ABI) software. 

iv. Ratio Calculations 

The peak area value of the 134 bp fragment (normal KIT) was divided by twice the 
peak area value of the 54 bp fragment Xspiice~mutant KITJin order to calculate the ratio 
value for each sample. 

v. Results 

Analysis was performed on animals from the Swedish wild pig/Large White intercross 
pedigree for which genotypes at / have been determined by conventional breeding 
experiments with linked markers. Figure 7 and the table below show the ratio of normal 
to mutant KIT calculated for animals from each of the three genotype classes, I/I 
(expected ratio 1:1), I/i (expected ratio 2:1) an&I/f (expected ratio 3:1). The results are 
entirely consistent with the expected ratio values and indicate that the three genotype 
classes can be distinguished using this method. 

Rati oftheTwoKITF rms in Different Dominant White Genotypes in a Wild 
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Pig/Large White Intercross 



Genotype Phenotype 



Expected 

Ratio 
(Normal: 
Mutant) 



Observed Ratio Number 
(Normal:Mutarit) Tested 



±SE 



I/I 
I/I p 

m 



white 
white 
white 



1:1 
3:1 
2:1 



1.15 ±0.075 
3.11 ±0.084 
2.23 ±0.109 



13 
12 
14 



Figure 7 illustrates that the range of ratio values calculated for the two genotypes I/I 
and I/f do not overlap. This enables animals carrying the f allele to be identified and 
the frequency of the allele within different pig breeds determined. Ratio values were 
calculated for 56 Landrace and 33 Large White animals and the results are shown in 
Figure 8. A clearly bimodal distribution is observed with 7 Landrace and 3 Large 
White individuals having a ratio value of approximately 3 or above, suggesting them 
to be heterozygous carriers for the / allele (genotype I/f). This means f has gene 
frequency estimates of 6.25% (7/112 chromosomes tested) and 4.5% (3/66 
chromosomes tested) within the Landrace and Large White breeds respectively. 

EXAMPLE 12 " " 
* 

(i) DNA Preparation 

DNA can be prepared from any source of tissue containing cell nuclei, for example 
white blood cells, hair follicles, ear notches and muscle. The procedure outlined here 
relates to blood cell preparations; other tissues can be processed similarly by directly 
suspending material in K buffer and then proceeding from the same stage of the blood 
procedure. The method outlined here produces a cell ly sate containing crude DNA 
which is suitable for PCR amplification. However, any method for preparing 
purified, or crude, DNA should be equally effective. 



Blood was collected in 50 mM EDTA pH 8.0 to prevent coagulation. 50 jil of blood 
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was dispersed into a small microcentrifuge tube (0.5 ml Eppendorf or equivalent). 
450 |il of TE buffer was added to lyse the red blood cells (haem groups inhibit PCR) 
and the mix vortexed for 2 seconds. The intact white and residual red blood cells 
were then centrifuged for 12 seconds at 13,000 g in a microcentrifuge. The 
supernatant was removed by gentle aspiration using a low pressure vacuum pump 
system. A further 450 of TE buffer was then added to lyse the remaining red blood 
cells and the white blood cells collected by centrifugation as before. If any redness 
remained in the pellet, this process was repeated until the pellet was white. After 
removal of the last drop of supernatant from the pelleted white blood cells, 100 \il of 
K buffer containing proteinase K was added and the mixture incubated at 55°C for 2 
hours. The mixture was then heated to 95-1 00°C for 8 minutes and the DNA lysates 
stored at -20°C until needed. 

Reagents 

TE buffer: 10 mM TRIS-HC1 pH 8.0 

1 mM EDTA 
K buffer: 50 mM KC1 

10mMTRIS-HClpH8.3 

2.5 mM MgCl 2 

0.5% Tween 20 



Prior to use for lysates, 10 jal of 20 mg/ml proteinase K (Molecular Probes Inc.) per 
1 .0 ml of K buffer was added. 

(ii) PCR 

Reactions were set up as follows in thin walled 0.25 ml tubes (Perkin Elmer): 

4.0 jil 5 |iM CRC Forward primer; 
4.0 \\\ 5 |iM CRC Reverse primer; 
4.0 (il 5 |iM A7T1-REV primer; 
4.0 j^l 5 |iM A771-FOR primer; 
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4.0 ul 2 mM dNTPs (Pharmacia); 
4.0 ul 35 mM MgCl 2 . 

A wax bead (PCR Gem 50, Perkin Elmer) was added and the tube placed in a Perkin 
Elmer 9600 thermal cycler. The tube was then raised to 80°C for 15 seconds followed 
by cooling to 4°C. A second set of reagents was then added to each tube as below:- 

4.0 ill lOx buffer; 
9.6 jal sterile deionised water; 

0.4 (0.5 units) AmpliTaq DNA polymerase (Perkin Elmer); 
2 jil DNA lysate. 

Reaction tubes were then placed on a Perkin Elmer 9600 thermal cycler preheated to 
94°C and PCR carried out according to the regime indicated below:- 

94°C for 4 minutes; 

20 cycles of 94°C for 30 sees, 62°C for 30 sees and 72°C for 30 sees; 
0°C until required. 

The number of cycles may vary depending upon the tissue used as the DNA source. 
ATT primers 

Forward GAATATTGTTGCTATGGTGATCTCC A771-FOR 
Reverse CCGCTTCTGCGTGATCTTCCTG A7T1-REV 

CRC primers 

Forward CTGGATGTGCTGTGTTCCCTGT CRC-FORWARD 
Reverse AGGTTTGTCTGCAGCAGAAGCTC CRC-RE VERSE 

The reverse ATT primer and the forward CRC primer are labelled with the ABI 
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fluorescent dye FAM at the 5* end. 

(iii) Electrophoresis and Quantitation of DNA Fragments 

1 of the PCR was mixed with 2.5 fxl of deionised formamide, 0.5 pi of GS350 
DNA standards, 0.4 jal blue dextran solution, heated at 90°C for 2 minutes followed 
by rapid cooling on ice. 3 ^1 of this mix were then loaded onto an AB 13 73 DNA 
sequencer and DNA fragments separated on a 6% polyacrylamide gel in 1 x TBE 
buffer for 2 hours at 700 V, 40 mA, 32 W. The fragments corresponding to the 
products from the KIT and CRC genes were quantitated using GeneScan software, the 
peak area for each of the bands being determined. 

(iy) Results 

The data given in the table below represents the results obtained from an experiment 
in which DNA lysates were produced from each of 23 animals, with two PCR tests 
being carried out on each lysate. The ratio of ATTpleak area to CRC peak area was 
calculated for each PCR and the average taken of those samples from the same 
animal. 



Animal 


Genotype 


KIT/CRC peak 
area ratio 


1 1 


II 


3.25 


2 


Ii 


2.45 ' 


3. 


II 


2.94 


4 


ii 


1.16 


5 


ii 


1.34 


6 


ii 


1.20 


7 


Ii 


2.18 


8 


Ii 


2.19 


9 


II 


2.88 


10 


ii 


1.30 


11 


Ii 


1.84 




II 


2.84 




ii 


1.50 
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14 


ii 


1.30 


P— 


Ii 


2.07 


16 


ii 


1.31 


17 


ii 


1.14 


18 


Ii 


2.02 


19 


II 


1.87 


20 


Ii 


2.00 


21 


ii 


0.99 


22 


ii 


1.15 


23 


II 


2.80 



The upper and lower limits for the ratio values from animals of the different 
genotypes 77, Ii and ii in this experiment are as below: 

Genotype U pper Limit Lower Limit 

/// 3.25 2.80 

Hi 2.45 1.84 

/// 1.50 0.99 

These results illustrate differentiation of the genotypes using this test. 

EXAMPLE 13 

The second test utilises unique sequences of DNA that are present at one end of the 
duplication (or both ends if the duplicated region is reversed relative to the rest of 
the gene or if the duplicated region does not occur in direct tandem with the non- 
duplicated region). Oligonucleotide primers for use in PCR are designed such that 
at the annealing temperatures used in the PCR process, they will anneal only to the 
junction regions at the end of the duplicated region. A PCR is then carried out 
using two pairs of oligonucleotides. One pair consists of the aforementioned 
primer spanning the junction region and a second primer a suitable distance away 
which allows amplification to occur only from / allele containing duplication. The 
second pair of primers allow amplification of a sequence present only as a single 
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copy in the haploid genome. The product of this reaction, carried out in the same 
tube, functions as an internal standard as in the previous test. The ratio of product 
from the reaction specific to the junction region is measured relative to that from 
the single copy control sequence. 

In this test there is a larger difference between the predicted ratios of the products 
from the different genotypes. The relative levels of product and their ratios are 
illustrated below: - 

Junction Control 
Genotype Product Product Ratio 

II 2 2 1:1 

Ii 1 2 1:2 

ii 0 2 0:2 

These larger ratios allow greater differentiation between the ranges of results 
obtained from the different genotypes, reducing risks of miss-scoring animals. 

EXAMPLE 14 

(i) DNA Preparation 

DNA can be prepared as described in example 12. 



(ii)PCR 

Reactions were set up as follows in thin walled 0.25 ml tubes (Perkin Elmer): 

2.0 |il 5 mM A77DEL2-FOR primer; 
2.0 |xl 5 mM AY7DEL2-REV primer; 
1.0 nl 2 mM dNTPs (Pharmacia); 
1.2 ^1 25 mMMgC12 
2.0 pi 1 Ox buffer (without MgC12) 

0.1 nl (0.5 units) AmpliTaq DNA polymerase (Perkin Elmer); 

2.0 |ol DNA lysate; 

9.7 |xl sterile deionised water. 
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Reaction tubes were then placed on a Perkin Elmer 9600 thermal cycler and PCR 
carried out according to the regime indicated below:- 

95°Cfor 1 minute; 

3 cycles of 95°C for 15 sees, 50°C for 20 sees and 72°C for 40 sees; 
27 cycles of 94°C for 15 sees, 50°C for 20 sees and 72°C for 50 sees; 
72°C for 5 minutes; 
4°C until required. 

The number of cycles may vary depending upon the tissue used as the DNA 
source. 

KIT primers 

Forward GAAAGTGA(C/T)GTCTGGTCCTAT(G/G)GGAT A77DEL2- 

FOR 

Reverse AGCCTTCCTTGATCATCTTGTAG A77DEL2-REV 

(iii) Electrophoresis 

1 jil of the PCR product was mixed with 3 jxl loading buffer (95% deioiiised 
formamide, lOmM NaOH, 20mM EDTA, 0.05% bromophenolblue, 0.05% 
Xylene-cyanol), heated to 95 °C for 3 minutes followed by rapid cooling on ice. 
The sample was then loaded on an 8% native polyacrylamide gel (Protogel, 37.5:1 
Acrylamideibisacrylamide, National Diagnostics, Atlanta) in 1 x TBE buffer 
(89mM Tris, 89mM boric acid, 2mM EDTA.Na2). The DNA fragments were 
separated by electrophoresis for 4.5 hours at 6 W with a constant temperature of 
20°C and 0.6 x TBE as running buffer in a vertical slab unit (SE600 Hoefer 
Scientific Instruments, San Francisco). 

(iv) Visualisation of DNA fragments by silver staining 

After electrophoresis the gel was incubated, with gentle agitation, in the fix 
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solution for 20 minutes or until the tracking dyes were no longer visible. The gel 
was rinsed three times (2 minutes each with agitation) in deionised water. The gel 
was then incubated in the staining solution for 40 minutes, with gentle agitation, 
followed by a brief wash (5-10 seconds) in deionised water and direct transfer to 
the developing solution. The gel was incubated in the developing solution until 
bands were clearly visible and then the development was terminated by adding an 
equal volume of fix solution. Finally, the gel was rinsed for 2 minutes in 
deionised water. 

Reagents 

Fix solution: 10% glacial acetic acid in deionised 

water 

Staining solution: 2 g silver nitrate (AgN03) 

3 ml 37% formaldehyde 
2 litres deionised water 

Developing solution: 60 g sodium carbonate (Na2C03) dissolved in 2 liters 

deionised water. Immediately before use add 3 ml 37% 
formaldehyde and 400 ml sodium thio sulfate (10 mg/ml). 
The solution should be at a temperature of i 0-1 2 6 C when 
used. 

* 

(v) Results 

This SSCP analysis reveals an informative polymorphism so far only found in 
animals with the dominant white phenotype (Fig. 9); In lanes 1 to 8 the analysis 
was carried out on DNA from Swedish Landrace pigs carrying the dominant white 
colour and in lanes 9 and 10 DNA was from wild pigs of wild type colour. The 
polymorphic bands are indicated. The polymorphism is characterised by two 
unique fragments only present in animals carrying a duplicated KIT gene of allele 
type /. The fragments represent heteroduplexes of DNA strands from PCR products 
of unequal length representing the duplicated and non-duplicated copy of the KIT 
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gene. The results of a screening test with this marker using 40 unrelated animals 
representing five breeds and 190 F2 animals from a Large White /Wild pig 
intercross are presented in the table below: 



BREED 


COLOUR 


NO. OF 
ANIMALS 


HETERODUPLEX 








PRESENT 


NOT 

PRESENT 


SWEDISH 
LANDRACE 


WHITE 


10 


10 


0 


SWEDISH LARGE 
WHITE 


WHITE 


8 


8 


0 


SWEDISH 
HAMPSHIRE 


COLOURE 
D 


10 


0 


10 


SWEDISH 
DUROC 


COLOURE 
D 


10 


0 


10 


WILD PIG 


COLOURE 
D 


2 


0 


2 


LARGE WHITE/ 
WILD PIG 
INTERCROSS 


WHITE 
PATCH 
COLOURE 

D - 


131 
9 

50 


106 

0 

0 


25 

1 



The results show that this particular polymorphism is very closely associated with 
the presence of the ATT duplication. It is not completely associated with the 
duplication as some white animals did not show the heteroduplex pattern. The 
polymorphism is therefore an example of a closely linked genetic marker which by 
itself or in combination with other linked markers can be used to differentiate 
genotypes as regards the dominant white coat colour. 

EXAMPLE 15 



i) DNA extraction 
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DNA was prepared as in example 12. 



ii) PCR 

Reactions were set up in 0.25ml thin walled reaction tubes (Perkin Elmer) as 
follows: 

0.5fil 5 uM KTTDEL 1 -FOR primer 
0.5 ul 5 uM KITBEL 1 -REV primer 
l.Oul 2mM dNTPs (Pharmacia) 
\.0\i\ 15mMMgCl 2 
l.Oul 1 OX buffer 
4.9ul Sterile distilled water 
0.1(4.1 AmpliTaq DNA polymerase 
l.Ofj.1 DNA lysate 

Reaction tubes were then placed in a Perkin Elmer 9600 thermal cycler and PCR 
carried out according to the regime 

94°C for 4 minutes; 

2 1 cycles of 94°C for 30 sec, 60°C for 30 sec, and 72°C for 30 sec 
72°C for 4 min; 
- 4 9 e until required.- 

The number of cycles used may vary depending on the tissue used as the source of 
the DNA. 

Primers 

forward TGTGGGAGCTCTTCTCTTTAGG A77DEL1-FOR 

reverse CCAGCAGGACAATGGGAACATCT A77DEL1-REV 



The reverse primer was labelled with the ABI fluorescent dye FAM at the 5' end. 
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iii) Electrophoresis and quantitation of DNA fragments 

of the PCR was mixed with 1.5|il of deionised formamide, 0.25|il of GS350 
DNA standards, 0.25pJ loading buffer (50mg/ml blue dextran, 25mM EDTA) and 
heated at 90°C for two minutes followed by rapid cooling on ice. 1.75u,l of this was 
then loaded onto an ABI 377DNA sequencer and DNA fragments separated on a 
4.12% polyacrylamide gel in lxTBE buffer for two hours at 3000V, 60mA, 
200W and 48°C. The 97bp and 93bp fragments corresponding to the products from 
the ATT gene template lacking the deletion and containing the deletion respectively 
were quantitated using GeneScan software, the peak area for each of the bands 
being determined. 

Results 

The data given in the table below represents the results obtained from an 
experiment in which DNA lysates were produced from each of 20 animals of 
known genotype with one PCR test being carried out on each lysate. The ratio of 
the peak area of the product from the DNA template not containing the four base 
pair deletion to that containing the deletion was calculated. 



ANIMAL 


GENOTYPE 


Non del/del 
peak area ratio 


1 


// 


1.347 ' - — 


2 


// 


1.21 


3 


// 


1.33 


4 


// 


2.267 


5 


II 


0.444 


6 


II 


0.713 


7 


II 


8.387 


8 


II 


0.994 


9 


II 


1.673 


10 


II 


1.056 


11 


Ii 


1.751 


12 


It 


1.73 
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13 


li 


1.83 


14 


li 


0.631 


15 


H 


1975 


16 


T * 


2.147 


17 


// 


1.901 


18 


// 


1.749 


19 


// 


2.103 


20 


// 


2.026 



For this small sample the value of 1.5 which is midway between the predicted ratio 
values for each genotype (expected ratio=2 for // and 1 for II) might be used as the 
dividing line for scoring the animals to either genotype. It can be determined from 
the table that 7/10 Hand 9/10 li are identified as the correct genotype. 



Example 16 

Sequencing of KIT cDNA clones 

mRNA was isolated from peripheral blood leukocytes from white (Landrace/Large 
White) and coloured (Hampshire) pigs using the Message Maker mRNA isolation 
system (Gibco BRL) with one mRNA selection from total RNA. lOOng poly(A) + 
mRNA was reverse-transcribed with random primers i^irst-Strand cDNA Synthesis 
kit, Pharmacia Biotech) and the product was used at a 1:10 dilution for RT-PCR 
using the proof-reading Advantage KlenTaq Polymerase (Clontech) according to the 
manufacturer's recommendation. The following primers were used to amplify almost 
the entire coding sequence and some of the 5* untranslated region: KIT40 (5'-GGC 
TCT GGG GGC TCG GCT TTG C) corresponding to the 5 'untranslated region and 
KIT22S (5'- TCA GAC ATC TTC GTG GAG AAG CAG AGG) corresponding to 
exon 21; both primers had been designed using consensus sequence of the human 
and mouse KIT sequences in the GENBANK database. The RT-PCR products were 
gel purified and cloned using the pGEM-T vector system (Promega). Plasmid clones 
were sequenced using a set of internal primers and the ABI Prism™ dRhodamine 
Terminator Cycle Sequencing Kit (PE Applied Biosystems). Two subclones 



59 

representing each type of KIT sequence were initially sequenced and in those cases 
where a discrepancy was observed (possibly due to PCR errors) additional clones 
were sequenced over those particular nucleotide sites. RT-PCR analysis of ATT exon 
16-19 was carried out with the primers KIT1F (5'-TCR TAC ATA GAA AGA GAY 
GTG ACT C) and KIT7R (5'-AGC CTT CCT TGA TCA TCT TGT AG). 

Results 

The sequence of the KIT gene coding region derived from an animal of the 
Hampshire Breed is shown in Figure 10. Differences between ATTcDNA 
sequences cloned from a Hampshire and a Yorkshire/Landrace pig, respectively 
are shown in the table below. The sequence comparison includes the whole open 
reading frame 2919 bp, except for the last 27 bp occupied by the reverse PCR 
primer. Exon and base pair position number as well as amino acid codon are given 
for each difference. Polymorphic bases are shown in bold. A dash indicates identity 
with the Hampshire (/) allele. 
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Example 17 
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DNA preparation 

Genomic DNA was prepared as described in example 12. 
PCR 

A 158bp fragemnt covering 99bp of the end of exon 19 and 59bp of the KIT gene was 
amplified using forward primer LA93 (5'-GAG CAG GCC CTA CCC CGG AAT GCC 
AGT TGA-3') and reverse primer KIT56 (5'-CTT TAA AAC AGA ACA TAA AAG 
CGG AAA CAT CAT GCG AAG G-3'). PCR was carried out dn a Perkin Elmer 9600 
Thermal Cycler in a total volume of 20|il containing 25ng genomic DNA, 1.5mM 
MgCl 2 , 50mM Kcl, lOmM TrisrHCl, pH 8.3, 200|iM dNTPs, 0.5u AmpliTaq Gold 
(Perkin Elmer) and 10 pmol of both LA93 and KIT56 primer. To activate AmpliTaq 
Gold, initial heat denaturation was carried out at 94°C for 10 minutes followed by 32 
cycles each consisting of 45 sec at 94°C, 45 sec at 55°C and 45 sec at 72°C. 

Restriction digestion and electrophoresis 

The PCR amplification product is 158 bp in length. To test for polymorphism at 
position 93 of this product (corresponding to position 2678 of the KIT cDNA sequence) 
digestion reactions were set up and incubated as below: - 

* 

6.0^ PCR product 

1 .0^1 1 Ox reaction buffer 3 (New England Biolabs) 

0.2\xlAcil(Su/\xl) 

2.8jil deionised water 

Following digestion at 37°C for 120 minutes each lO^il reaction volume had 2|il of 
loading dye aded and the mix was loaded on an 8% native polyacrylamide gel 
(Protogel, 37.5:1 acrylamide:bisacrylamide, National Diagnostics, Atlanta) in 0.5 x 
TBE (44.5 mM Tris pH8.0, 44.5 mM boric acid and 0.5 mM EDTA) and 
electrophoresed for 3 hours at 200v in a vertical slab gel unit (SE600 Hoefer Scientific 
Instruments). Products are visualised by ethidium bromide staining. 
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Results 

The reverse primer is designed such that an Acii site is introduced into flie amplified 
sequence. This results in digestion of amplicon with Acii releasing a fragment of 23bp 
that allows confirmation of the digestion process. Digestion of the remaining 135 bp 
fragment into fragments of 92 and 43 bp is dependant on the nucleotide at the position 
corresponding to position 2678 of the KIT cDNA sequence. T at this position prevents 
digestion while a C at this position allows digestion. Gel resolution is not sufficient to 
allow resolution of the 23bp fragment but comparison to undigested product allows 
confirmation of the process. 



Figure 1 1 illustrates the results obtained with animals of a range of genotypes. 

The test was used to analyse a total of 66 unrelated individuals from seven breeds of 
pig. The results are shown in the table below: 



Breed 


No. 


KIT Genotype 1 


Genotype at pos'n 2678 










C/C 


C/T 


T/T 


Hampshire 


4 


i/i 


1 


1 


2 


" Polish Wild Boar 


13 


i/i 


0 


: r ""r 


" "12 


Duroc. 


11 


i/i 


0 


i 


10 


Pietrain 


1 


i/i 


0 


0 


1 


Swedish Wild Boar 


1 


i/i 


1 


0 


0 


Swedish Landrace 


12 


VI 


0 


0 


12 




5 


vr 


0 


2 


3 


Swedish Yorkshire 


14 


vi 


0 


1 


13 




5 


vr 


0 


1 


4 



1 Genotype based on Malll RFLP analysis as described in example 11. 
Example 18 
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Determination of genotype at the I locus using a rapid DNA based test 
Crude DNA lysates were prepared from hair samples from animals of three breeding 
lines, a Hampshire based line, a Large White line, and white animals from a cross bred 
line originally produced from the two former lines. Four hair follicles were placed into 
100^1 of K buffer (50mM KC1, lOmM Tris-HCl pH 8.3, 2.5mM MgCl 2 , 0.5% w/v 
Tween 20) and 1^1 Proteinase K (15 mg/ml) (Boehringer) added. This mix was 
incubated for 2 hours at 55°C followed by 16 min at 95°C. DNA was also prepared as 
described in example 12. 

Allelic discrimination reactions were set up using the PE Applied Biosystems 
TaqMan™ system. 25nl reactions contained the primers E19FOR (5- 
GAGCAGCCCCTACCCCGGAATGCC AGTTGA-3 ') and E19REV (5- 
CTTTAAAACAGAACATAAAAGCGGAAAGATCATGCGAAGG-3 , ) at 300nM, 
8% glycerol (w/v) IX TaqMan™ buffer A (PE Applied Biosystems), 5mM MgCl 2 , 
200^iM dATP, dGTP, dCTP and dUTP, 0.65 units AmpliTaq Gold™ (PE Applied 
Biosystems), 0.25 units AmpErase™ UNG (PE Applied Biosystems) and the 
TaqMan™ probes E19PC (5'- C ATAC ATTTCCGC AGGTGC ATGC-F AM) and 
E19PT (5'- TCATACATTTCCACAGGTGCATGC-TET) at a concentration of 
lOOmM. ljil of crude lysate DNA was used as template. PCR amplification was 
carried out using a PE9600 thermal cycler (PE Applied Biosystems) or a the ABI7700 
Pqsm (PE Applied Biosystems) with a thermal cycling regime of 50°C for '2 -min 
followed by 95°C for 10 min followed by 40 cycles of 95°C 15 sec, 62°C 1 min. 8 
control samples of each homozygote genotype, 2678C and 2678T, and 8 no template 
controls where deionized water was substituted for template controls were used per 96 
well plate. Allele identification based on these reactions was carried out using the 
allelic discrimination function of the ABI7700 Prism (PE Applied Biosystems). 

Results 

The test was used to analyse a total of 20 unrelated individuals from four breeds of pig. 
The results are shown in the table below: 

Breed No. Assumed KIT Genotype at pos'n 2678 

Genotype 
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C/C 


CAT 


T/T 


Hampshire 


5 


i/i 


1 


1 


3 


Landrace 


5 


I/I 


0 


0 


5 


Duroc 


5 


i/i 


0 


0 


5 


Pietrain 


1 


i/i 


0 


0 


5 



Example 19 

Complete cosegregation of the Belt coat colour locus and KIT 
Method 

Hampshire pigs have a characteristic coat colour phenotype with a white belt on a 
solid black background. Belt is determined by a dominant allele (Be). The segregation 
of the Belt locus was investigated in a backcross between Hampshire (Be/Be) and 
Pietrain (be/be) pigs. Fl sows (Be/be) were back-crossed to pure-bred Pietrain (be/be) 
boars. DNA preparations were carried out exactly as described in Example 3. 

KIT exon 19PCR RFLP 

i) PCR to produce DNA for the RFLP test 

A 158 bp fragment covering 99 bp of the 3' end of exon 19 and 59 bp of intron 19 of 
the KIT gene was amplified using the following primers: 
forward LA93 5' - GAGCAGCCCCTACCCCGGAATGCCAGTTGA -3' 
and reverse 

KIT56(5' -CTTTAAAACAGAACATAAAAGCGGAAACATCATGCGAAGG -3'). 
PCR was carried out in a total volume of 20 \x\ containg 25 ng genomic DNA, 1.5 
mM MgCl 2 , 50 mMKCl, 10 mM tris-HCl, pH 8.3, 200 nM dNTPs, 0.5 U AmpliTaq 
Gold (Perkin Elmer) and 10 pmol of both LA93 and KIT56 primer. To activate 
Amplitaq Gold, initial heat denaturation was carried out at 94°C for 10 minutes 
followed by 32 cycles each consisting of 45 sec at 94°C, 45 sec at 55°C and 45 sec at 
72°C. 
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ii) Restriction enzyme digestion and electrophoresis 

The PCR amplification product is 158 bp in length. To test for polymorphism at 
position 93 of this product, digestion reactions were set up and incubated as follows: 

6.0 ill PCR amplified DNA 
1.0 ill 10 X NEBuffer 3 
0.2 \i\Aca (5 U/^il) 
2.8jxldH20 

(1 X NEBuffer (New England Biolabs) contains 100 mM sodium chloride, 50 mM 
Tris-HCl, 10 mM magnesium chloride, and 1 mM DTT). Following digestion at 37°C 
for 120 minutes, two ^il loading dye was added to each sample and the mix loaded on 
a 12 % native polyacrylamide gel in 0.5 % TBE (44.5 mM Tris pH 8 .0,44.5 boric 
acid and 0.5 mM EDTA) and electrophoresed for 3 hours at 200 V in a vertical slab 
unit. Products were visualised by Ethidium bromide staining. 

Results 

KIT nucleotide 2678 is polymorphic and a C or T occurs at this position. The 
presence of a C creates a restriction site for ^crl which is absent when a T is present. 
A second Acil site has been engineered into the reverse primer KIT56 to serve as an 
internal control of digestion and is therefore invariant. The polymorphism can be 
detected by a simple PCR-RFLP analysis as described in the table below. 

Detection of KIT single nucleotide polymorphism (SNP) at position 2678 

Size in bp of DNA 
Nucleotide fragments after digestion 

C 23 + 43 + 92 

T 23 + 135 



The cosegregation of the Belt and KIT loci in this pedigree is summarised in the table 
below. 
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Cosegregation between KIT and Belt in a Hampshire/Pietrain backcross 



Animal No tested Phenotype Belt locus KIT SNP2678 



Fl sows 


14 


Belt 


Be/be 


C/T 


Pietrain sires 


2 


non-Belt 


be/be 


T/T 


Offspring 


41 


Belt 


Be/be 


C/T 


Offspring 


41 


non-Belt 


be/be 


T/T 



The complete cosegregation between the Belt phenotype and the KIT polymorphism 
shows that this phenotype most likely is controlled by a mutation at the KIT locus. 
This means that detection of KIT polymorphism can be used to identify animal 
products derived from Hampshire pigs since the Belt is the most important breed 
determinant in Hampshire pigs. It is likely that the Belt phenotype present in 
Saddleback and Hannover-Braunschweig pigs is controlled by the same locus. 

Example 20 

Determination of the sequence of the 5* untranslated and 5' coding region of the 
aMSHR gene 

: The entire coding region of the aMSHR gene was determined and compared "Between 
pig breeds known to carry the different at the E locus, and EF. Hampshire 

carries Ef 1 and has a solid black body interrupted with a white belt. This belt is the 
result of another coat colour locus. The Wild Boar which carries allele E? has a 
wildtype phenotype while the Pietrain breed carries allele EF and is characterized by 
having black spots on a white body. 

PCR to produce DNA for clone construction 

The entire coding region of the aMSHR gene was amplified from genomic DNA 
using primers EPIG 1 0 and EPIG 1 6. These primers have sequence: 
EPIG10 5' - GGT CTA GAT CAC CAG GAG CAC TGC AGC ACC -3' 
EPIG16 5' - GGG AAG CTT GAC CCC CGA GAG CGA CGC GCC - 3' 
PCR was carried out on a DNA thermal cycler (Perkin Elmer 9600) in a total volume 
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of 20 \il containing 25 ng genomic DNA, 1.5 mM MgCl 2 , 50 mM KC1, 10 mM Tris- 
HC1, pH 8.3, 200 jiM dNTPs, 5.0 % DMSO (dimethyl sulfoxide), 0.5 U AmpliTaq 
Gold (Perkin Elmer) and 10 pmol of both EPIG10 and EPIG16, To activate AmpliTaq 
Gold, initial heat denaturation was carried out at 96°C for 10 minutes followed by 32 
cycles each consisting of 45 sec at 94°C, 45 sec at 55°C and 45 sec at 72°C. The final 
extension lasted for 7 min at 72°C. 

Cloning of PCR products 

To facilitate cloning of PCR products, both primers were designed with restriction 
endonuclease recognition sites located at the 5' end. Primer EPIG10 has sequence 
TCTAGA which is cut using enzyme Xbal and EPIG16 contains sequence AAGCTT 
which is cut using enzyme HindllL Following PCR as described above, the entire 
reaction volume was electrophoresed and purified using the Qiaex II gel extraction kit 
following the manufacturers instructions (Qiagen). The purified PCR product was 



digested prior to ligation as follows: 

PCR product 17.0 jil 

S.OuHindill (Amersham) 1.0 ^il 

S.OuXbal (Amersham) 1-0 nl 

x 1 0 reaction buffer M (Amersham) 3 .0 |il 
xlO bovine serum albumin (Amersham) 3.0 pi 

H 2 0 5.0^ 



The reactions were incubated at 37 degrees C for 16 noiiS before the digested DNA 
was purified by passage through a QIAquick spin column following the 
manufacturers instructions (Qiagen). PCR products were ligated into 100 ng of vector 
pRc/CMV (Invitrogen) using 400 U T4 DNA ligase (New England Biolabs) in a total 
reaction volume of 20 ^1 containing 10 mM MgCh, 50 mM Tris-HCl, pH 7.5, 10 mM 
dithiothreitol, 1 mM ATP and 25 |ag/ml bovine serum albumin. Ligation reactions 
proceeded at 16 degrees for 16 hours. 

Preparati n and Sequencing of Plasmid DNA 

Plasmid DNA was purified from overnight bacterial culture using the Jetstar plasmid 
rriidi kit 50 (Genomed) and the resulting DNA diluted to 15 ng/jal. Dye terminator 
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cycle sequencing was performed using AmpliTaq DNA polymerase in accordance 
with the ABI Prism protocol P/N 402078 (perkin Elmer). Cycle sequencing reactions 
comprised 1:6 pmole of either T7 or SP6 sequencing primer (Promega), 15 ng of 
plasmid DNA and the terminator ready reaction mix (Perkin Elmer). The cycle 
sequencing reactions were performed in a GeneAmp 9600 machine over 25 cycles, 
each consisting of 10 sec at 96°C, 5 sec at 50°C and 4 min at 60°C. Extension 
products were purified for gel separation using ethanol precipitation, loaded and run 
on a 377 ABI Prism DNA sequencer as described by the instrument protocol (Perkin 
Elmer protocol P/N 402178). 

Results 

A 2bp insertion was identified in the aMSHR gene of pigs of the Pietrain breed which 
carry the E 3 allele between nucleotide positions equivalent to 66 and 67 in the Wild 
Boar aMSHR sequence. This results in a shift in the translation frame and creates a 
TGA stop codon at nucleotide positions equivalent to 161 to 163 in the Wild Boar 
aMSHR sequence. The 5' portion of the aMSHR coding sequence compared 
between three breeds is shown below. This comparison illustrates the two base pair 
insertion present within the alleles carried by the Pietrain animal when compared with 
either the Hampshire or Wild Boar alleles.. The ATG start codon is highlighted in 
bold, the 3' end of primer EPIG16 is shown in italics and bases in common with the 
Pietrain sequence are marked with a dash: Missing bases are marked with :. 

Pietrain CGACGCGCCC TCCCTGCTCC CTGGCGGGAC GATGCCTGTG CTTGGCCCGG 

Meishan - : 

Wild Boar ■ — ; 

Pietrain AGAGGAGGCT GCTGGCTTCC CTCAGCTCCG CGCCCCCAGC CGCCCCCCCC 

Meishan — 1 :: 

Wild Boar • ■ :: 

Pietrain GCCTCGGGCT GGCCGCCAAC CAGACCAACC AGACGGGCCC CCAGTGCCTG 

Meishan r — : 

Wild Boar ; — . — : •■ 

Pietrain GAGGTGTCCA TT 

Meishan 

Wild Boar ■ — — 

These results are also incorporated into figure la. 



Example 21 
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A rapid DNA test for the presence of the 2bp insertion mutation in the porcine 
aMSHR gene allowing rapid distinction of the E? allele from all other alleles 
identified at this locus 



PCR was conducted with forward primer EPIG16 (see above) and reverse primer 
MC1R121A exactly as described above. The reverse primer was labeled with ABI 
dye Hex and has sequence: 

MC1R121A 5' - Hex- GGA CTC CAT GGA GCC GCA GAT GAG CAC GGT - 

3V 

Following PCR cycling, 0.2 (il of the reaction volume was mixed with 2.5 j^l of 
deionised formamide, 0.5 |il of GS500 DNA standard (ABI) arid 0.4 jil blue dextran 
solution before being heated to 90°C for 2 minutes and rapidly cooled on ice. 1 of 
this mix was then loaded onto a 377 ABI Prism sequencer and the DNA fragments 
separated on a 6% polyacrylamide gel in 1 X TBE buffer for 2 hours at 700 V, 40 
mA, 32 W. The length of the resulting PCR products were determined using the 
GeneScan software (ABI). 

Results 

A test was devised to assay genomic DNA directly for the presence of the identified 2 
bp inserticii.-Primer3iEPIGi6 and"Tv4CiR12iA were used to PCR amplify 448 bp of 
the 5' portion of the wildtype porcine MC1R gene. To facilitate fluorescent detection 
of amplified products, the ABI dye HEX was covalently attached to the 5' end of 
primer MCI Rl 21 A. PCR was conducted on a number of unrelated individuals from 
three breeds and the resulting PCR products size determined using the GeneScan 
software (ABI). The results are presented in the table below: 



Breed Number Extension Size of PCR product 

tested Genotype 

■ , 448 bp 450 bp 

Pietrain 5 EFlEf 0 5 

Large White 3 ETIEP 0 3 

Hampshire 5 5 0 
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Analysis of the length of PCR products amplified between individuals showed either a 
448 bp or 450 bp product resulted. The expected 448 bp fragment was amplified from 
each Hampshire animal, however a product 2 bp longer wds detected from each 
Pietrain and Large White animal tested. This indicates the 2 bp insertion identified via 
sequence analysis to be present in the genomic DNA of Pietrain and Large White, two 
breeds ascribed the E? allele, but not in Hampshire which carries £*. 

EXAMPLE 22 

In addition to the coding region of the aMSHR gene, DNA sequence polymorphism 
may exist between breeds within the untranslated regions (UTR). Sequence 
information was collected from the 3' UTR and compared between six breeds of pigs 
which display a variety of coat color phenptypes. 

PCR to produce DNA for sequencing 

A 454 bp product containing 38 coding nucleotides from the 3' portion of the 
molecule and 416 bp of 3' untranslated region (not including primer binding sites) 
was amplified using primers EPIG13 and EPIG14. These primers have sequence: 
EP1G13 5' -GCA AGACCCTCC AGGAGGTG - 3' 

EPIG14 5' - CAC TGA GCC GTA GAA GAG AG - 3* 

PCR was carried out on a DNA thermal cycler (Perkin Elmer 9600) in a total volume 
of 20 \A containing 25 ng genomic DNA, 1.5 mM MgCl 2 , 50 mM KC1, 10 mM Tris- 
HC1, pH 8.3, 200 \iU dNTPs, 0.5 U AmpliTaq Gold (Perkin Elmer) and 10 pmol of 
both EPIG13 and EPIG14. To activate AmpliTaq Gold, initial heat denaturation was 
carried out at 96°C for 10 minutes followed by 32 cycles each consisting of 45 sec at 
94°C, 45 sec at 55°C and 45 sec at 72°C. The final extension lasted for 7 min at 72°C. 

Sequencing of PCR products 

PCR products were sequenced using dye terminator chemisty. This first requires 
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purification of PCR product free from excess dNTPs End primers. This was achieved 
by passage of the template DNA through a QIAquick spin column following the 
manufacturers instructions (Qiagen). Cycle sequencing reactions were performed on 
20 ng of purified template using either EPIG13 or EPIG14 as described in Example 1. 

Results 

Primers EPIG13 and EPIG14 were used to amplify a 454 bp region of DNA which 
comprises both the 3' terminal coding region of the MC1R gene and the immediately 
adjacent 3' UTR. The sequence information collected is displayed in figure 15 and 
two polymorphic positions were identified. 

The first polymorphism identified is a 1 bp deletion common to Meishan and Large 
Black which occurs seven positions downstream from the stop codon at the equivalent 
of nucleotide position 1007 in the Wild Boar sequence. As this deletion occurs outside 
the translated region it is not expected to alter the amino acid composition of the 
resulting receptor molecule, however its influence on mRNA stability and 3' end 
formation through endonucleolytic cleavage is unknown. It is unique to two breeds 
which carry Ef" and which carry a variant of the aMSHR gene not found in any other 
breed examined to date. 

The second polymorphism is a base substitution at nucleotide position 1162 unique to" 
thq European Wild boar. Figure 15 shows five breeds to have a G base at this position 
and the Wild boar to contain an A. This sequence difference offers the possibility to 
distinguish the European Wild boar from the other breeds analysed with a DNA based 
test. 



